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Abstract 

Let (X„)„>o be an irreducible, aperiodic, homogeneous Markov chain, with state space an ordered 
hnite alphabet of size m. Using combinatorial constructions and weak invariance principles, we 
obtain the limiting shape of the associated RSK Young diagrams as a multidimensional Brownian 
functional. Since the length of the top row of the Young diagrams is also the length of the longest 
(weakly) increasing subsequence of (Xk)i<k<m the corresponding limiting law follows. We relate 
our results to a conjecture of Kuperberg by showing that, under a cyclic condition, a spectral 
characterization of the Markov transition matrix delineates precisely when the limiting shape is 
the spectrum of the in x m traceless GUE. For m = 3, all cyclic Markov chains have such a 
limiting shape, a fact previously only known for m — 2. However, this is no longer true for m > 4. 
In arbitrary dimension, we also study reversible Markov chains and obtain a characterization of 
symmetric Markov chains for which the limiting shape is the spectrum of the traceless GUE. To 
hnish, we explore, in this general setting, connections between various limiting laws and spectra of 
Gaussian random matrices, focusing in particular on the relationship between the terminal points 
of the Brownian motions, the diagonals of the random matrix, and the scaling of the off-diagonals, 
a scaling we conjecture to be a function of the spectrum of the covariance matrix governing the 
Brownian motion. 
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1 Introduction 

The limiting distribution of the length of the longest increasing subsequence of a random permuta- 
tion, was first determined by Baik, Deift, and Johansson [T], who identified it as the Tracy- Widom 
distribution, which describes the limiting behavior of the largest eigenvalue of the Gaussian unitary 
ensemble (GUE), as the size of the GUE goes to infinity. 

The identification of the limiting distribution of LI n , the length of the longest (weakly) increasing 
subsequence of a random word of length n, whose letters are iid and chosen uniformly from an ordered, 
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m-letter alphabet, was first made by Tracy and Widom [28]. They showed that the limiting distribution 

of LI n , properly centered and normalized, is that of the largest eigenvalue of the traceless mxm GUE. 
In the non-uniform iid case, Its, Tracy, and Widom [20\ [2Tj described the corresponding limiting 
distribution as that of the largest eigenvalue of one of the diagonal blocks (corresponding to the 
highest probability) in a direct sum of certain independent GUE matrices. The number and respective 
dimensions of these matrices are determined by the multiplicities of the probabilities of choosing the 
letters, and the direct sum is subject again to an overall zero-trace type of condition. 

The well-known Robinson-Schensted-Knuth (RSK) correspondence between sequences and pairs 
of Young diagrams led Tracy and Widom |28| to conjecture that the (necessarily m-row and of the 
same shape) Young diagrams of a random word generated by an m-letter, uniform iid sequence have 
a limiting shape given by the joint distribution of the eigenvalues of a m x m traceless element of 
the GUE. Since the length of the longest row of the Young diagrams is precisely LI n , this appears 
to be a natural generalization. Johansson [22] proved this conjecture using orthogonal polynomial 
methods. Further, Okounkov |27| . and Borodin, Okounkov, and Olshankii [6], as well as Johansson 
[22] . also answered a conjecture of Baik, Deift, and Johansson [21 13] regarding the limiting shape of 
the Young diagrams associated to a random permutation of {1, 2, . . . , n}. In particular, as n grows 
without bound, the lengths Ax, A2, . . . , Xk of the first k rows of the Young diagrams, appropriately 
centered and scaled, have the same limiting law as the k largest eigenvalues of a n x n element of the 
GUE, a result first proved, for k = 2, in [21 [3]. 

Extensions to the non-uniform iid case were addressed in Its, Tracy, and Widom [201 121] . who 
focused primarily on the top row of the Young diagrams. Here the obvious conjecture is that the 
limiting shape has rows whose suitably centered and normalized lengths have a joint distribution 
which is that of the whole spectrum of the direct sum of GUE matrices described above. Below, we 
prove this result as a special case of the Markovian framework. 

Kuperberg [21] conjectured that if the word is generated by an irreducible, doubly-stochastic, 
cyclic Markov chain, then the limiting distribution of the shape is still that of the joint distribution 
of the eigenvalues of a traceless mxm element of the GUE. For m = 2, this was shown to be true 
by Chistyakov and Gotze [9] (see also [H]), who, in view of further simulations, expressed doubts 
concerning the validity for m > 4. For m = 3, we will show that the conjecture holds as well. 
However, for m > 4, this is no longer the case. Indeed, some, but not all, cyclic Markov chains lead 
to a limiting law as in the iid uniform case already obtained by Johansson [22] , 




The precise class of homogeneous Markov chains with which Kuperberg's conjecture is concerned 
is more specific than the ones we shall study. The irreducibility of the chain is a basic property 
we certainly must demand: each letter has to occur at some point following the occurrence of any 
given letter. Moreover, the doubly-stochastic hypothesis ensures that we have a uniform stationary 
distribution. However, the cyclic (also called circulant) criterion, i.e., the Markov transition matrix 
P has entries satisfying pij = pi + \j + i, for 1 < i,j < m (where m + 1 = 1), is more restrictive: 
cyclicity implies but is not equivalent to P being doubly stochastic. Starting from a free-probability 
perspective, Kuperberg was led to introduce this cyclicity restriction via simulations [24] which appear 
to show that at least some irreducible, doubly-stochastic, non-cyclic Markov chains do not produce 
such limiting behavior. 

The paper is organized in the following manner. In Section 2, we present a combinatorial formu- 
lation of the LI n problem and so obtain a functional of combinatorial quantities which describes the 
shape of the entire Young diagrams with n cells, along with a concise expression for the associated 
asymptotic covariance structure. In Section 3, we apply Markovian Invariance Principles to express 
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the limiting shape of the Young diagrams as a Brownian functional for all irreducible, aperiodic, ho- 
mogeneous Markov chains (without the cyclic or even the doubly-stochastic constraint.) Using this 
functional we are then able to answer Kuperberg's conjecture. In Section 4, we investigate, in further 
detail, various symmetries exhibited by the Brownian functional and in particular obtain, for m arbi- 
trary, a necessary and sufficient condition for a cyclic Markov chain to have the same limiting law as 
in the iid uniform case. Still for m arbitrary, a related characterization is also obtained for reversible 
Markov chains with symmetric transition matrices, i.e. in the doubly stochastic case. In Section 5, 
we further explore connections between the various Brownian functionals obtained as limiting laws 
and eigenvalues of random matrices. In particular, we relate the behavior of the terminal points of 
the Brownian motions to the diagonals of certain Gaussian random matrices, conjecturing that the 
off-diagonals scale according to some yet to be determined function of the spectrum of the covariance 
matrix governing the Brownian motions. Finally, in Section 6, we conclude with a brief discussion of 
natural extensions and complements to some of the ideas and results presented in the paper. 

2 Combinatorics 

The combinatorial development for the m- letter alphabet, as obtained in [17] , resulted in m — 1 partial 
sums S^, 1 < r < m — 1 . Using an even more straightforward development which involves m quantities 
instead, we can obtain more symmetric expressions for LI n . This is done next, and will prove useful 
when studying the shape of the whole RSK Young diagrams. 

Let a r k be the number of occurrences of a r among {Xi)i<i<k- Each increasing subsequence of 
(-Xj)i<j<fc consists simply of consecutive identical values, with these values forming an increasing 
subsequence of a r . Moreover, the number of occurrences of a r G {a\, . . . ,a m } among (Xi)k+i<i<t, 
where 1 < k < £ < n, is simply a£ — a r k . The length of the longest increasing subsequence of 
X\ , X2, ■ ■ ■ , X n is thus given by 

LI n = max [(al 1 -a 1 ) + (al 2 -4 1 ) + --- + (a™-a% m _ 1 )}, (2-1) 
0<fei<— 
<fc m _i<n 

i.e., 

LI n = max [(4 i -al i ) + (al 2 -a 3 fc2 ) + ... + (a-- 1 i -a^_ 1 ) + a™], (2.2) 
0<fci<— 
<fc m _i<n 

where aJJ = 0. 

Moving beyond the purely combinatorial setting, assume that (Xk)k>o is an infinite sequence gen- 
erated by an irreducible homogeneous Markov chain having a stationary distribution {tv\, 7T2, . . . , vr m ). 
(For no k > is the law of X^ necessarily assumed to be the stationary distribution.) For each 
1 < r < m, set TT = a r k — Tr r k, for k > 1, and Tq = 0. Beginning again with (|2.ip . we find that 
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LI n = max 

0<iti<- 
<k m -i<n 



max 

0<fci<- 

<fc m _i<n 



« - al) + « - afj + ■ ■ ■ + « - < m _ J 

((T4 + Trafci) - (Tfe + ttiAo)) + {{Tl + vr 2 fc 2 ) - {Tl + tt^)) 



+ • • • + + v m k m ) - (T fe ™_ a + 7r m A; m _i)) 

- T fc M + {Tl —Tj* )-\ h (T fc m - T fc m / 



max 

0<fci<-- 

<fc m _i<n 



+ 7ri(fci - fe ) + vr 2 (A;2 - k{) H h 7r m (A; m - fe m _i) 

Setting 7r max = max{7ri, 7r 2 , . . . , 7r m }, (|2.3p becomes 



(2.3) 



LI n -ir max n= max V] [(IT - 7T ) + (7r r - ir max )(k r - k r -i)] ■ (2.4) 
0=fe o <fci<- ~[ 

For a uniform stationary distribution, 7r maa; = ir r = 1/m, for all r, and (j2.4j) simplifies to 

m 

LI n = max V(T fc r - T r k ). 

m o=fc <fei<- ^ 

To introduce a random walk formalism into the picture, we next set, for i = 1, . . . ,n and r 
1,2,... , m, 



(2.5) 



1, if Xi = Or, 
0, otherwise. 



(2.6) 



Clearly, a\ = Y* = l W L and so T fc = Eti W - vr r ), for 1 < r < m. 

To understand the limiting law of (|2.4p or (|2.5p . we must have a more precise description of the 
underlying Markovian structure. To that end, let p rs = P(X^ + i = a s \Xk = a r ), and let P = (p rs ) be 
the associated Markov transition matrix. In this setting, 



(k) 

Moreover, as usual, let prj denote the fc-step transition probability from a r to a s ; its associated 
transition matrix is simply P k . 

Assume now that the law of Xq is the stationary distribution. Thus, by construction, ET£ = for 
all 1 < r < m and 1 < k < n, and our primary task is to describe the covariance structure of these 
random variables TT. First, for k > 1, 



k-l 



k ii— 1 



VarII = ^VarW i r + ^ £ Clover, W Q + £ E Cov ^' W l 



(2.7) 



i=i 



il=l i2=«l+l 



11=2 22 = 1 
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Next, by stationarity, and since W[ is, simply, a Bernoulli random variable with parameter ir r , (j2.7|) 
becomes 



k fe— 1 k k ii— 1 

VarT^ = ^Var^+^ £ Cov(W r , W£ + E £ Cov ^ 

i=l ii=li2=ii+l ii=2i2=l 

fe— 1 A; A; ij— 1 

= A*r P (i-7r r ) + E (^r^-^ + EE^r 2 '-^) 

*1=1*2=«1+1 ii =2*2=1 

fe— 1 fc fc 11— 1 

= A:^ r - A; 2 ^ + 7r r E E ^P i2 ~ h eJ + vr r E E e rP h ~ i2 e J r ', (2.8) 
il=l i2=ii+l ii=2i2=l 

where e r = (0, 0, . . . , 0, 1, 0, . . . 0) is the r t/l standard basis vector of M m . Setting 

fc-1 k k 
H=l*2=*l+1 i=l 

we can rewrite (j2.8|) in the simple form 

Var T r k = kir r - k 2 7r 2 + 2-K r e r Q k e^. (2.10) 

Our description of the covariance structure can now be completed using the above results. For 
r\ ^ T2 and k > 1, 

cov(r-,T-) = Ecov(w-,^) + E E cov(^,^) + EE Cov ( w u^; 2 ) 

i=l il = li2=ii + l ii=2 12=1 

=j2covw\wn + E E Cov W^ 2 -i,) + EE c^w^u) 

i=l ii=li2=«i+l ii=2i2=l 

fe— 1 k k i\ — 1 

= -/c7T ri 7T r2 + E E (^l^n,^ - TTriTra) + E E ( 7r ^Pr 2 1 ,ri i2) ~ ^1^2) 
ii=li2=ii+l ii=2i2=l 
fc— 1 fc fc ii— 1 

= -feV ri ^ 2 + ^ ri E E e ^ !2 " ii ^+^EE^ pil " !2e n 

il=li2=ii+l ii=2i2=l 

= —k 7r ri 7r r2 + 7r ri e ri (5fce r2 +7r r2 e r2 Qfc<. (2.11) 



Remark 2.1 Both (|2.10p and ()2.11j) appear to be asymptotically quadratic in k. However, since 
Qk = Yli=i(k ~ cancellations will show that when the Markov chain is irreducible and aperiodic, 
the order of the variance is, in fact, linear in k. 
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In order to further analyze the asymptotics of Qk, we first examine the Jordan decomposition of 
P for a very general class of transition matrices. 



Proposition 2.1 Let P be the m x m transition matrix of an irreducible, aperiodic, homogeneous 
Markov chain. Let P = S~ 1 AS be the Jordan decomposition of P, where the rows of S consist of the 
generalized left-eigenvectors of P with, moreover, the first row of S being the stationary distribution 
(7Ti,7T2, . . . ,vr m ), and where A = diag(l,A), with A being the (m — 1) x (m — 1) (lower) Jordan-block 
matrix. Then the first column of S 1-1 is (1,1,..., 1) T . 

Proof. Since P = S~ 1 AS, then PS^ 1 = S~ 1 A. Denoting the first column of S^ 1 by c\, we have 
Pc\ = c\. But since the rows of P sum to 1, we see that c\ = (1, 1, . . . , 1) T satisfies Pc\ = c\. Moreover, 
ci must be unique, up to normalization, since the irreducibility of P implies that the eigenvalue Ai = 1 
has multiplicity 1. Finally, since the inner product of the first row of S and the first column of S -1 is 
1, the correct normalization is indeed (1, 1, . . . , 1) T . ■ 

Returning to Qk, as given in (12.9p . and using Proposition l2.lt we then obtain: 

Theorem 2.1 Let (X n ) n >g be a sequence generated by an m-letter, aperiodic, irreducible, homoge- 
neous Markov chain with state space A m = {«l < • • • < a m }, transition matrix P, and stationary 
distribution (ni, 7T2, . . . , 7r m ). Let also the law of Xq be the stationary distribution. Moreover, for 
1 < r < m, let T£ = a r k — n r k, for k > 1, and Tq = 0, where a r k is the number of occurrences of a r 
among (Xi)i<i<k- Then, for 1 < r < m, 

Var T r 

lim — k - = TT r (1 + lerS^DSeJ) , (2.12) 

fc^oo k v 



and for t\ ^ r<i, 



lim Cov ^ Tk ,Tfc I = n ri e ri S~ l DSeT + ^e^S^DSeE, , (2.13) 

fc^OO k 2-1 



where P = S 1 AS = S 1 diag(l, A)S is the Jordan decomposition of P in Proposition \2.1\ and D = 
diag{— 1/2, A(J — A) -1 ). That is, the asymptotic covariance matrix of (Tl,T?, . . . jTF 1 ) is given by 

S = n + n(5 -1 D5) + (S^DSfU, (2.14) 

where II = diag(iri,TT2, ■ ■ ■ , 7T m ). 

Proof. Beginning with (12. 9p . we employ the Jordan decomposition of P in Proposition 12.11 to find 
that 

fc-l /fe-i \ / k-l \ 

Q k = J2(k - i){S- l ASy = S- 1 Y,( k ~ S = S^diag h(l), J^(fc - »)A* S, (2.15) 
i=l \i=l / V i=l J 

where h(X) := ^2k=i( n ~ k)^ k ■ Now h{l) = k(k — l)/2 is quadratic in k, while for A ^ 1, 

. A A(A fc -l) 
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so that h(X) is linear in k. Now, by the aperiodicity of P, the diagonals of the lower-triangular matrix 
A are all less than unity in modulus. We can thus write the matrix sum in (|2.15p as 

h(A) := kA(I -A)' 1 +o(k). 

The matrix can now be expressed as the sum of terms which are, respectively, quadratic and linear 
in k. Recalling, moreover, that the first row of S contains the stationary distribution, and that the 
first column of S^ 1 is (1,1,..., 1) T , we have 



Q k = S- 1 dmg(h(l),h(A))S, 

k 2 ( 1 ~ \ 

= — S _1 diag(l, 0, . . . , 0)5 + fcS^diag I A ( J - A) -1 J S + o(k) 



2 



/ 7T1 7T2 
7Q 7T2 



7T m \ 



+ fc5~ 1 D5 + o(A;). 



(2.16) 



Starting with the variance in (I2,10p . we now find that, for each 1 < r < m, 



Var T[ = k-n r — k 2 ir 2 + 2ir r e r Qke. 

. ^ 2 

k 



lT r - k 2 TT 2 . + 27T r (^Y 7Tr + fcer5 ' lDSe r^j + 

A:^ r (1 + 2e r S~ 1 DSel) + o(fc), 



from which the asymptotic result ()2. 12|) follows immediately. 

An identical development shows that, for r% ^ r 2 , (|2.11|) simplifies to 



(2.17) 



Cov(Tl\Tl 2 ) = -k 2 ir ri ir r2 + TT ri e ri Q k e^ 2 + 7r r2 e r2 Q fc e^ 

Q f k 1 'T"> "\ 

= -k 7T ri TT r2 + 7T ri I -^-7Tr 2 + ke ri S~ DSe r2 J 

+ vr r2 ^^"^ri + ke r2 S~ l DSe^^j + o(k) 
= k [Tx ri e ri S~ l DSe^ 2 + 7r r2 e r2 5 _1 D5e^ i ) + o(k), 



from which the asymptotic result (12.130 follows, and so does (I2.14D . 



(2.18) 



Remark 2.2 To see that (|2.12p and f)2. 13j) 6o£/i recover the covariance results for the iid case inves- 
tigated by the authors in \17$ , let P be the transition matrix whose rows each consist of the stationary 
distribution (tti, 7T2, . . . , ir m ). In this case A 2 = • • • = A m = 0, and so D = diag{— 1/2, 0, . . . , 0). Hence, 
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e^DSel = (!,*,...,*)£> (vr r2 , *, . . . , *) T = 



for all r\ and T2, and so, for each r, 



li m Var T k -,. ( i .2 f ) ) -,.( 1 - ~, 



while, for r\ ^ r<i, 



ifc 1 = ^ ( — ~ — —) = -in "ra- 



in i/ie uniform iid case, ir r = 1/m, for all 1 < r < m. Hence, for r\ ^ r<i, the asymptotic correlation 
between T^ 1 and T£ 2 is given by (— l/(m 2 ))/((l/m)(l — 1/m)) = — l/(m — 1), so that the covariance 
matrix is indeed the permutation- symmetric one obtained in [11]. There is, moreover, another Brow- 
nian functional representation for the iid uniform case in fl 7 ^ in which the Brownian motions have a 
tridiagonal covariance matrix. 



3 The Limiting Shape of the Young diagrams 

Thus far, our results have centered on LI n alone, essentially ignoring the larger question of the structure 
of the entire Young diagrams. The present section extends the combinatorial development of the 
previous section to answer the question of the limiting shape of the Young diagrams. Our first result 
in this direction is a purely combinatorial expression generalizing (12. ip . 

Theorem 3.1 Let R^^R^, ■ ■ ■ be the lengths of the first 1 < r < m rows of the RSK Young 
diagrams generated by the sequence (Xk) i<fe<n whose elements belong to an ordered alphabet *A m — 
{ai < • • • < a m }. Then, for each 1 < r < m, the sum of the lengths of the first r rows of the Young 
diagrams is given by 

r r m—r+j 

where J T ^ m = {{kj^, 1 < j < r, < I < m) : kj/ = 0, 1 < j < r, < £ < j — 1; kj/ = n, 1 < j < 
r,m — r + j < £ < m; A^_i < kj t g, 1 < j < r, 1 < £ < m; < kji, 1 < j < r — 1, < £ < m — 1}, 

and where a\, is the number of occurrences of at among (Xj)i<j<fc. 

Proof. Recall that the sum of the lengths of the first r rows of the RSK Young diagrams generated by 
a sequence (^/ c )i<A ; <r i , whose letters arise from an m-letter alphabet, has an interpretation in terms 
of the length of certain increasing sequences. Indeed, the sum R\ + R\ + • • • + R r n is equal to the 
maximum sum of the lengths of r disjoint, increasing subsequences of (Yfc)i<fc< n , where by disjoint it 
is meant that each element of {Xk)i<k<n occurs in at most one of the r subsequences. (See Lemma 
1 of Section 3.2 in [12]). More general results of this sort, involving partial orderings of the alphabet 
and associated antichains, are known as Greene's Theorem [16] . However, such results are not enough 
for our purpose. Below we need a different way of reconstructing disjoint subsequences. 
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We begin by examining an arbitrary collection of r disjoint, increasing subsequences of (-Xjfc)l<fc<n> 
and show that we can always map these r subsequences onto another collection of r disjoint, increasing 
subsequences whose properties will be amenable to our combinatorial analysis. 

Specifically, with the number of rows r fixed, suppose that, for each 1 < j < r, we have an 
increasing subsequence (X J J )i<£< n . of length rij < n, and that the r subsequences are disjoint. 

£ 

We first construct the new subsequence (Xh)^^^ as follows. First, place all ct\s occurring 

k e — — 

among the r original subsequences into (X| 1 )i<£<fi 1 , if there are any. If the last ot\ occurs at the 

k t — — 

n th mc j eX) then (XJ 1 )i<f<^ 1 is complete. Otherwise, place all 02s which occur after the final a\ into 

k e — — 

(Xh)i<t<ni, ^ there are any. If the last a2 occurs at the n th index, then {Xh)i<i<m is complete. 

k ( k t 

Otherwise, continue adding, successively, 03, . . . , a m _ r+ i in the same manner. Thus, (Xh)i<i<m 

k ( — — 

consists of a weakly increasing sequence of length hi having values in {a%, . . . , a m _ r +i}. 

Next, we construct the new subsequence (X? 2 )i<i<n 2 similarly. By considering only those letters 

k e 

among the r original subsequences which have not already been moved to the first new subsequence, 
start with the smallest available letter, 0-2, and continue adding, successively, 0-3, . . . , a m +r+2- Note 

that, crucially, all a^s added to (X? 2 )i<i<n 2 occur before the last index at which a\ was added to the 

1 

first subsequence. More generally, each otj, 2 < j < m — r + 2, added to (X? 2 )i<e<n 2 occurs before 

1 

the last CKj-i was added to the first subsequence. Thus, (^ 2 )i<^<n2 consists of a weakly increasing 

k e — — 

subsequence of length 77,2 having values in {02, ■ ■ ■ , a m - r +2}- 

The construction of (XL)i<^<n •, for 3 < j < r, continues in the same manner, with {X 3 tj )x<^<ni 

k t J ' k e J 

constructed from among the entries of the r original subsequences which were not moved into any of 
the first j — 1 new subsequences, so that (Xl 1 )i<g<n , consists of a weakly increasing sequence of length 

hj having values in {ay, . . . ,a m - r +j}- It is possible that beyond some j > 2 the new subsequences 
may be empty. 

We claim that, indeed, the construction of the r th new subsequence exhausts the set of available 
entries. Indeed, without loss of generality, assume that after we have created the (r — l) th new 
subsequence, the set of available entries is non-empty, and designate the location of the final to be 
included in the j th new subsequence by kj g, for 1 < j < r and 1 < i < m. (If no ag was available for 
inclusion, set kji = kjj>-i, where kj t o = 0, for all 1 < j < r.) Clearly, all ai,a2, ■ ■ ■ ,QV-i have been 
included in the first r — 1 new subsequences. If r = m, we are done: simply put the remaining a r s 
into the r th new subsequence. If r < m, we may still ask whether there was, for some r + 1 < £ < m, 
an ai from among the available entries which occurred before Av/-i- Assume that there is such an 
ag. Now by construction, kj + i^_ r+ j < fcj^_ r . + j_i, for 1 < j < r — 1. Hence, there exist letters 
aj 1 < ctj 2 < ••• < otj r < ag-\ among the original subsequences which occurred after /c r ^_i, and, 
moreover, each letter must come from a different subsequence. But since each original subsequence 
was increasing, none of them could have contained an ag before k rt g-i, and we have a contradiction. 

To better understand this construction, consider the first row of Figure [Tj which shows an initial 
sequence of length n = 12, with m = 4 letters, broken into r = 3 disjoint, increasing subsequences of 
lengths rii = 3, r?-2 = 4, and 713 = 3, and so with total length 10. The final three rows of the diagram 
show the results of the operations described above, producing 3 new increasing subsequences of length 
hi = 4, fi2 = 3, and 77,3 = 3. 
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Figure 1: Transformation of r = 3 subsequences. 
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Figure 2: Schematic diagram of J r ,mi for r = 4, m = 7. 



Hence, if we wish to find r disjoint, increasing subsequences whose length sum is maximal, it 
suffices to consider only those disjoint, increasing subsequences for which the final occurrence of the 
letter ae in the subsequence i happens after the final occurrence in the subsequence j, whenever i < j. 
Because such ranges do not overlap, if we wish to count the number of ags in the j th subsequence, it 
suffices to simply count the number of a s s in (X&)i<fc< n over that range. 

Indeed, returning to the fundamental combinatorial objects of our development, the a?,, we see 
that since a\ — a? k counts the number of ays in the range I + 1, . . . , k, we can describe the valid index 
ranges over which to search for the maximal sum as J r>m = {(£^,1 < j < r, < £ < m) : kj t i = 
0, 1 < j < r, < I < j — 1; kj £ = n, 1 < j < r,m — r + j < £ < m; kj^—\ < kjj, 1 < j < r, 1 < 
I < m; fcj+i^+i < kj g, 1 < j < r — 1, < £ < m — 1}. The constraints on the kjj follow simply from 
the fact that each subsequence is increasing and that, moreover, the intervals associated with a given 
letter do not overlap. Figure [2] indicates the relative positions of each range, for r = 4 and m = 7. 

Since the first possible letter of each subsequence grows from ot\ to a r , and the last possible letter 
grows from a m+r -i to a m , the result is proved. ■ 

We are now ready to apply our asymptotic covariance results (Theorem l2.ip . along with a Brownian 
sample-path approximation, to the combinatorial expression (13. ip . and so obtain a Brownian functional 
expression for the limiting shape of the Young diagrams for all irreducible, aperiodic, homogeneous 
Markov chains. 

Indeed, for each 1 < r < m, let the sum of the first r rows of the Young diagrams be given by 



r 



r m—r+j 





where the index set J r ^ m is defined as in Theorem l3.11 Define as before T£ = ^2i = i(W[ — ir r ) = a r k —TT r k, 
and so rewrite (13. 2p as 
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r m—T+j 

y n = ™ E E (fe + - K,,-! + 



r to— r+J 

= , m ^ E E (fe- r 4 ( 3 - 3 ) 

Next, let r be a permutation of the indices 1, 2, . . . , m such that 7r r m > ^(2) > ■ ■ ■ > ^rfml > 0- 
Moreover, we demand that if 7r T (j) = n T (j) for j < j, then r(«) < r(j). (The permutation so defined 
is thus unique.) Let v r = Yl^i^rij) be the sum of the r largest values among 7ri,7T2, . . . ,vr m . We 
obtain, below, the limiting distribution of (V* — v r n)j^fn as a Brownian functional. 

To introduce Brownian sample-path approximations, and for each 1 < r < m, we first define the 
asymptotic variance of as in (|2. 12[) , by 

Var T r 

al ■- Urn — — a = e r Se^, (3.4) 

n— >oo n 

and, for r\ ^ ri, the asymptotic covariance of T^ 1 and T£ 2 by 



a := Urn Cov ( T " 1 ' r " 2 ) = e n Se^, (3.5) 

where £ is the covariance matrix of Theorem 12.11 associated with the transition matrix P. For each 
1 < r < m, we then let 

Tf , + (nt - \nt])(Wr - vr r ) 
B r n (t) = 1 J !l M+1 (3.6) 

for < t < 1. This rescaling of [0,n] to [0, 1] calls for us to define a new parameter set over which 
we will maximize a functional arising from the expressions in (|3.6p . Indeed, for any positive integers 
s and d, with s < d, define the set 



I s ,d = {(he, l<j<s,0<£<d) :t u = 0,1 < j < a,Q < I < j - 1; 

tj^ = 1, 1 < j < s, d — s + j < I < d; 
tj,e-i < tjj, 1 < j < s, 1 < £ < d; 

t j+ i,e+i < tj,e, 1 < j < s - 1,0 < I < d- l}. 

Note that the constraints ijj— 1 = and tj^-s+j = 1 5 for 1 < J < s> force many of the t,^ to be zero 
or one. We will denote the sx(rf+ l)-tuple elements of I St d, by (i. ,.). Figure [3] shows the structure of 
I s d, for s = 4 and d = 7. The locations of tji are indicated by the horizontal lines within the diagram. 
With this notation, (|3,3p becomes 



T rr r m—T+j r m—r+j 

VI — nv r 



{E E ^(^y-^fe-i)) +E E v^(^-7rro-))(^-^-i)}- 

(3.7) 



- = max 

n (t.,.)G/r-,m . , . , 
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Figure 3: Schematic diagram of I St d, for s = 4, d = 7. 



Our analysis of (|3,7p will yield the following theorem, whose proof we defer to the conclusion of the 
section. This theorem gives, in particular, a full characterization of the limiting shape of the Young 
diagrams in the non-uniform iid case. 



Theorem 3.2 Let (X n ) n >Q be an irreducible, aperiodic, homogeneous Markov chain with finite state 
space A m = {a\ < ■ ■ ■ < a m }, transition matrix P, and stationary distribution {tt\,it2, ■ ■ ■ , v" m ). Let 
T, = (ov )S )i< riS < m be the associated asymptotic covariance matrix, as given in (|2.14p . and let the law 
of Xq be given by the stationary distribution. Let r be the permutation of {1,2, ... ,m} such that 
flY(i) > n r(i+i)> an d T (*) < T (i) whenever n T r^ = n T (j) and i < j. For each 1 < r < m, let V£ be 
the sum of the lengths of the first r rows of the associated Young diagrams, and let v r = Y2j=i ^rQj) ■ 
Finally, let d r be the multiplicity of Tr T r r \, and let 



m r 



0, if^r(r) = 7Tt(1)j 

max{i : tt t u\ > tt t ^}, otherwise. 



Then, for each 1 < r < m, 



V£ — nv T 



rn r 



n 



i=l 

r-mr (d r +m r -r+j) 



max ^ Yl ^( m ^)(^ T(mr+£) (^)- J B r(mr+ ' ) (^-i)), (3- 



where the first sum on the right-hand side of <\3.8h is understood to be 0, if m r = 0. Above, a 2 . = a r , r , 
and (B 1 (i) , B 2 (t) , . . . , B rn {t)) is an m- dimensional Brownian motion, with covariance matrix £ = 
(v r ,s)i<r,s<m given by 

(a r>s ) = t(a r , s )/a r a s , (3.9) 
for 1 < r, s < m. Moreover, for any 1 < k < m, 
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,3,0) 

Remark 3.1 The critical indices d r and m r in Theorem \3.2\ are chosen so that 

K T (m r ) > 7r T(m r +i) = 7Tr(r) = • • • = ^ T (m r +d r ) > ^T(m r +dr+l) • 

Thus, the functional in (13.8P consists of a sum of m r Gaussian random variables and a maximal 
functional involving only d r of the m one- dimensional Brownian motions. 

Remark 3.2 Another, perhaps more natural, way of describing the covariance structure of the Tri- 
dimensional Brownian motion in Theorem \ 3.S\ is to note that o"2-B 2 (t), . . . ,a m B m (t)) has 
covariance matrix tH . 

Let us now examine the case r = 1. Here, as previously noted, = LI n . Since mi = 0, (|3.8p 
becomes 



T T — dl 



max 



5> tW (B T <4(t M ) -B^(t M _ x )) , (3.11) 



n (*.,.)e/ Ml ^ 

where W6 hcive Written T^raax 

for 7iY(i) . The functional in (|3.1ip is similar to the one obtained in the 
iid case in [17], the essential difference being, not in the form of the Brownian functional, but rather 
in the covariance structure of the Brownian motions. 

To see precisely where this difference comes into play, note that if the transition matrix P is cyclic, 
then the covariance matrix of the Brownian motion is also cyclic. Consider then the 3-letter aperiodic, 
homogeneous, doubly-stochastic Markov case. Since the Brownian covariance matrix is symmetric, 
and, moreover, degenerate, an additional circularity constraint forces it to have the permutation- 
symmetric structure seen in the iid uniform case. In particular, LI n will have as a limiting law, up to 
a scaling factor, the maximal eigenvalue of the traceless 3x3 GUE: 



LI n -n/3 

— =^ a max 

V™ (t.,.)eli, 3 - 



3 



£ (B\t u )-B\t u _ x ) ), (3.12 



where a = o~e, for all 1 < I < 3, and with the Brownian covariance matrix given by 

1 -1/2 -l/2\ 
■1/2 1 -1/2 t, 
-1/2 -1/2 1 / 

and where we have used the fact that t(£) = £, for all 1 < I < 3. 

However, when m > 4, the cyclicity constraint does not force the Brownian covariance matrix to 
have the permutation-symmetric structure, as the following example shows for m = 4. 
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Example 3.1 Consider the following doubly- stochastic, aperiodic, cyclic transition matrix: 



P 



(OA 


0.3 


0.2 


0.1\ 


0.1 


0.4 


0.3 


0.2 


0.2 


0.1 


0.4 


0.3 


\0.3 


0.2 


0.1 


0.4/ 



(3.13) 



While the doubly- stochastic nature of P ensures that the stationary distribution is uniform, the covari- 
ance matrix of the limiting Brownian motion, at three- decimal accuracy, is computed to be 



( 1.000 
-0.357 
-0.287 

\-0.357 



-0.357 
1.000 
-0.357 
-0.287 



-0.287 
-0.357 
1.000 
-0.357 



-0.357\ 
-0.287 
-0.357 
1.000 J 



(3.14) 



and a 2 



a 2 := 0.263, for each 1 < r < 4. Thus, 



LI n - n/4 



n 



a max V ( B l (t x £) - B l (t x t 

1=3 



(3.15) 



for 1 < r < 4. However, while the form of the functional is the same as in the iid uniform case (up 
to the constant), the covariance structure of the Brownian motion in (|3.14|) differs from that of the 
uniform iid case, i.e., from 



1 


-1/3 


-1/3 


-l/3\ 


1/3 


1 


-1/3 


-1/3 


1/3 


-1/3 


1 


-1/3 


1/3 


-1/3 


-1/3 


1 / 



t. 



(3.16) 



and so the limiting distribution in (|3.15p is not that of the uniform iid case. 

We thus see that Kuperberg's conjecture regarding the shape of the RSK Young diagrams for 
random sequences generated by aperiodic, homogeneous, and cyclic matrices [23] is not true for general 
m-alphabets. By simply extending the first-row analysis above to the second and third rows, we see 
that it is true for m = 3. However, it fails for m > 4. In the next section we shall see that for the 
cyclic case the structure of £ can be described in an way which delineates precisely when the limiting 
law is the spectrum of the traceless GUE. 

In the more general doubly stochastic case, we have the following corollary: 

Corollary 3.1 Let the transition matrix P of Theorem \3.S\ be doubly stochastic. Then, for every 
1 < r < m, m r = 0, d r = m, and 



tt T I r m-r+j 

n =► (t EE" (B%,e) ~ B%,«-i)) • (3.17) 

3=1 t=] 

If, moreover, the matrix P has all entries of 1/m, then 
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VI. — rn/m \Jm — 1 

— =>• max 



r m—r+j 



E E (^(^) - B e (tj,e-i)) , (3.18) 



n m (t.,.)eir„„ 

and i/te covariance matrix in (|3.9p /ias its off- diagonals equal to — l/(m — 1). 

Proof. For each 1 < r < m, ir r = 1/m, and so f r = r/m, m r = 0, and the multiplicity d r = m. 
Moreover, the permutation r is simply the identity permutation. This proves (|3.17p . If, moreover, 
all the transition probabilities are 1/m, then the multinomial nature of the underlying combinatorial 
quantities a r k tells us that of = (l/m)(l — 1/m), for each 1 < r < m, and that p ri ,r 2 = ~ l/( m ~~ 1)) 
for each ri ^ r 2 , thus proving f|3. 18|) . ■ 

To see that the functional in (I3.17P is generally different from the uniform iid case, even for m = 3, 
consider the following non-cyclic example: 

Example 3.2 Let a doubly-stochastic (but non-cyclic), aperiodic Markov chain have transition matrix 

/OA 0.6 0.0\ 

P = 0.6 0.0 0.4 . (3.19) 
\0.0 0.4 0.6/ 

Again, the doubly-stochastic nature of P ensures that the stationary distribution is uniform, and in 
the present example, the asymptotic covariance matrix, at three-decimal accuracy, is computed to be 

0.459 0.049 -0.506\ 

0.049 0.086 -0.136 . (3.20) 
v -0.506 -0.136 0.642 / 

Note that, even though we have a uniform stationary distribution, the asymptotic variances (i.e., the 
diagonals of (I3.20p ) have dramatically different values. Moreover, according to Remark \2.S\ in the 
uniform iid case, the only possibility for the Brownian covariance matrix is that the off-diagonals have 
value —1/2. However, the Brownian motion covariance matrix obtained from (I3.20p is 

1.000 0.246 -0.935\ 

0.246 1.000 -0.577 t. (3.21) 
-0.935 -0.577 1.000 / 

Not only are the off-diagonals different from —1/2, but in some cases are even positive. In short, the 
functional in (|3.17p has a distribution which differs from any iid case (even non-uniform). 

Remark 3.3 Generalizing a result of Baryshnikov C]/ and of Gravner, Tracy, and Widom 115)/ on the 
representation of the maximal eigenvalue of an m x m element of the GUE, Doumerc [11] found a 
Brownian functional expression for all the eigenvalues of an mxm element of the GUE. Our expression 
in (I3.18P is similar, with the exception that our m- dimensional Brownian motion is constrained by a 
zero-sum condition, and, moreover, has a different covariance structure. (We note, moreover, that the 
parameters over which his Brownian functional is maximized in I JJj / might be intended to range over 
a slightly larger set which corresponds to our Ir,m-) Using a path-transformation technique relating 
the joint distribution of a certain transformation of n continuous processes to the joint distribution of 
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the processes conditioned never to leave the Weyl chamber, O'Connell and Yor 126]/ employed queuing- 
theoretic arguments to obtain different types of representations for the entire spectrum of the m x m 
element of the GUE. In a study of much more general transformations of this type, Bougerol and 
Jeulin J?]? obtained this result as a special case and moreover these two representations are shown to 
be equivalent to each other in Biane, Bougerol and O'Connell f^. In both these works, the maximal 
and minimal eigenvalues have the same representations and it is also the same as the one given by 
our Brownian functionals. However, our representation for the rest of the spectrum is different from 
theirs and the equivalence with the works cited above is not that immediate. Indeed, in our case, a 
single supremum is taken over a rather involved set while in theirs, sequencees suprema and infiraa 
are taken over much simpler sets. Another piece of work of interest is reported by Kirillov I23f . where 
minima and maxima can be disentangled via classes of linear transformations (see Theorem 3.3 and 
Theorem 3.5 in JS^). 

If d r = 1, i.e., if the r th most probable state is unique, then the following result can be viewed as 
lying at the other extreme from Corollary 13.11 

Corollary 3.2 Let 1 < r < m, and let d r = 1 in Theorem \3.2\ Then 



' : ;"--E^(.»B' (,) (l). (3.22) 



Proof. If d r = 1, then m r = r — 1, and so the maximal term of (|3.8p contains only one summand 
namely CT T ( mr+ i)-B T ( mr+1 < l (l) = (T T / r )B T ( r \l). Including this term in the first summation term of (13 . 8 1 
proves (pH2l) . I 



Remark 3.4 The maximal term of the functional in (|3.8p is that of the doubly-stochastic, d r -letter 
case. Indeed, the maximal term involves precisely d r Brownian motions over the r — m r rows. Such a 
functional would arise in a doubly-stochastic d r -letter situation with a covariance matrix consisting of 
the sub-matrix of the original T, corresponding to the d r Brownian motions, as in Corollary \3.1\ The 
Gaussian term corresponds to the functional of Corollary \3.2\ That is, in some sense, the limiting law 
of (|3.8h interpolates between these two extreme cases. 

Proof. (Theorem 13.21) Since the r = m case is trivial (V™ is then identically equal to re), assume 
that r < m. Recall the approximating functional (|3,7p : 



yr _ nv ( r m_r +i r m-r+j . 

" = max E °* ~ Biihe-i)) +^ ^ Vn"(vr, - 7r r(i) ) (t j>t - t^ x ) . 

Vn Ir ' m L i=i e=j ,=i t=3 J 

(3.23) 

Introducing the notation Atjj := and M^(Atj^) := M^(tj^) — M^tj^-i), for any Tri- 

dimensional process M(t) = (M 1 (t) , M 2 (t) , . . . , M m (t)) , t G [0,1], we can rewrite (|3.23j) more com- 
pactly as 
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- r m—r+j r m—r+j 

V' — nv r 



n 



™ ax lE Y ^Bi(At M ) - Y (^0)-vr,)|A^|i- (3-24) 

r ' m l i=i <=j i=l <=i J 



The main idea of the proof to follow will be to show that the second summation of (|3.24p can, 
in effect, be eliminated by choosing the (Atj t e) in an appropriate manner. Now some of the coeffi- 
cients (ir T rjS %i ) are zero; such terms do not cause any problems. Intuitively, however, the remaining 
terms should have |Atj^| = 0. Defining the restricted set of parameters I* m = 6 I r ,m ' 

Y7j=i Y^£=j v 71 "^ ~~ n T(j))\Atj t i\ = 0, 1 < £ < m}, we see that, provided I* m ^ 0, 

r m—r+j r m—r+j 

max^ ^2 (°"^(A^) - y/n(n T (j) - Tv e ) \AtjA > max^] ^ c^B^Ai^). (3.25) 

3=1 £=j 3=1 1=3 

Moreover, by the Invariance Principle and the Continuous Mapping Theorem, 

r m—r+j r m—r+j 

^Y Y ^(Ay^max^ Yl <reB e (At jt e). (3.26) 

r,m j = 1 £=j r,m j = l £=j 

We claim that, indeed, I* m ^ 0, and that, moreover, 



r m—r+j r m—r+j 

^Y Y ( a i^i( A he) - Vn (TT T (j) - n) \AtjA =^max^ ^ a e B £ (At j>e ). (3.27) 

We will prove that I* m / by creating a bijection between I* m and I r -m r ,d r - To this end, for 
1 < £ < m r , let I T u) i = [u T (i),i-i> u T(i),i] = [0, !]• Next, choose any (u v ) 6 I r -m r ,d r , and define further 
intervals I T ( mr +j),i = Auj^, for 1 < j < r — m r and 1 < ^ < d r . 

We now create a partition of these intervals in a manner which relies on the ideas used in the proof 
of Theorem 13. 1[ Consider the set of points {uj > £}n<j< r - mr i<£<d r )i and order them as sq := < s\ < 
■ ■ ■ < < s K := 1, for some integer k, and let As q = [s ? _i, s q ], for all 1 < q < K. 

Trivially, for each 1 < q < k, and for each 1 < i < m r , As g C I T (i),i- Moreover, for each 
1 < j < r — m r , there exists a unique £(j,q) such that As g C I T {m r +j),i(j,q)- For each q, consider the 
set of indices A q := {r(l), . . . , r(m r )} U {T(m r + £(1, g)), . . . , r(m r + £(r — m r , q))}, and order these r 
elements of A q as 1 < £(1, g) < • • • < ^(r, q) < m. 

Using these partitions, we examine, with foresight, the following functional of a general m-dimensional 
process (M(t)) t > : 

m r (r-m r ) (r—m r +d r -l) 

^M r «(l)+ Y M T( - mr+e \A Uj/ ) (3.28) 

i=i j=i i=j 
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m r / k 



( 



(r—m r ) (r— m r +d r — 1) 

E l e at« (A Sg ) I + E E E MT(m ^ ( As ? ) 

i=l \g=l / j=l £=j 



re / m r (V— m r ) 

= E E Mr(4) ( As ^ + E M r(mr+ ^ ,,?)) (As g ) 

3=1 \i=l j=l 

= EE^(^) =EE m ^)(a s?) 

9=1 j=i i=i 9=1 

r r 

= EE M%9) ( A ^)' ( 3 - 29 ) 

where, for each 1 < j < r, and for each 1 < i < m, tj t £ := max{sg : I > ^(j, g)}. (That is, for each j, we 
collapse together intervals As q corresponding to the same component M .) Now, since our functional 
in (|3.29p has non-trivial summands only for I such that ^ T (C) — ^tMi we have shown that (i v ) S I* m - 
The following example illustrates this argument. Suppose we have an alphabet of size m = 8, with 

(vri, vr 2 , . . . , 7rg) = (0.07, 0.1, 0.2, 0.06, 0.2, 0.06, 0.1, 0.2). 

Then, 

7Tt(1) = Tt(2) = ^(3) = 0- 2 ) mi= m 2 = m 3 = 0, 4=4=4= 3, 

vr r (4) = 7T T ( 5) = 0.1, m 4 = m 5 = 3, 4 = 4 = 2, 

7r r{6) = 0.07, m 6 = 5, 4 = 1, 

7r T (7) = vr T ( 8 ) = 0.06, mj = ms = 6, 4 = 4 = 2. 

In particular, note that the two largest, distinct probability values are 0.2 and 0.1, of multiplicities 
3 and 2, respectively. Next, consider the case r = 4. We now show how I r -m r ,d r = 4-3,2 = 4,2 
corresponds to an element of I* m = J| g . Figure H] shows a typical element of the unconstrained index 
set 4,8- 

Now t(1) = 3,t(2) = 5, t(3) = 8,r(4) = 2, and r(5) = 7. Our construction begins with the 
amalgamation of m r = 7714 = 3 rows, corresponding to the three indices for which 7Tj is strictly less 
than 7r T ( r ) = 7r T (4) = 0.1, with 4,2- This is shown in Figure [5j 

Finally, we simply reorder each vertical column in the original order of the indices, as shown in 
Figure EJ We see that, first of all, we have constructed an element of 4 s- Moreover, since we have 
three rows whose indices are associated with the maximum value, and a remaining row whose indices 
are associated with ir T u-\, we indeed have an element of 4 8 . Note that the 4 x 4 = 16 free indices in 
4,8 (corresponding to the locations of the 16 vertical bars in Figure HJ) have been reduced to a single 
one in I| 8 . 

In addition, we may essentially reverse this construction, starting with an element of I* m (7^ 0), 
and so obtain an element of I r -m r ,d r - Indeed, from the definitions of I* m and v r we know that 

r r m—r+j 

u r = E ^(j) = E E ^1 A *J.*I> 

j=l 3=1 1=3 



19 
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5 


2 


3 


4 


5 


6 


3 


4 


5 


6 


7 


4 


5 


6 


7 


8 



Figure 4: A typical element of I^g. 




Figure 5: Amalgamating 3 rows with 7^2. 




Figure 6: Reordering vertically to obtain an element in 7| 8 . 
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for any G ^* >m - However, we also have 

r m—r+j r m—r+j 

Y Y n i\ At jM = 1 {mr>0}{Yl E 1 {T.W>TT(m r) }^l A ^l 
j=l l=j 3=1 l=j 

r m—r+j 

+E E 1 w w <^ (mr) }^i A ^i) 

r m—r+j 
+ l{m r =0}7rr(l) E E ' At ^l 

< 1 {7« r >0}(( 7r r(l) H 1" 7Tr(m r )) + ( r ~ "V)*V(r)) + l{ mr=0 }^7rr(l) 

= ^r, 

with equality holding throughout if and only if m r = or m r > and Y2j=i \ = 1> f° r an ^ such 

that vt tW > vr T ( mr) , and that, moreover, YJj=i HT=J +J 1 {^ ( o= 7r r M }l At i/I = r ~ m r- If "V > 0, then, 
for any (i. r ) G I* m , we may start with (|3.29p . and use again the permutation of the indices employed 
there. We thus obtain the first term of (|3.28p . which corresponds to the condition Y^j=i l A A?/l = ^ 
for all £ such that tt t ^ > 7r T ( mr ), and also the second term of ()3.28p . which corresponds to the other 
condition Y7j=i ^2T=j T+J ^-{^ T (i)=^ T ( r )}\^jA = r — m r . If m r = the same reasoning holds, except 
that the first term in (|3.28|) is taken to be zero. 

Having thus established a bijection between I* m and I r -m r ,d r i we ma y thus maximize over these 
two parameter sets, and so, for any process (M(t))t>o, obtain the general result 



m r (r-rrir) (r-m r +d r -l) r m-r+j 

^M T «(1)+ max Y E M T ^ +e \A Uj/ ) = maxY Y M l ^ q \At jA ). (3.30) 

i=l Ir-mr.dr j=l g _. #,m j = 1 £=1 

We now proceed to show that ()3.27|) holds. First, fix c > 0, and, for each 1 < £ < m, set 

C ( =h [i ^<^(r), (331) 

I 0, otherwise. 

Next, let Mi{t) = o t B l n (t) - c e t, and let M e (t) = a e B e (t) - c e t. Then, for n large enough, namely, 
for n > c/(7r T ( r ) — vr T ( r+1 )), we have that, almost surely, for any (i r ) G I r ,m, 



r m—r+j r m—r+j 

Y E Mi(At jj£ )>Y E (^n(A^)-^K )-vr £ )|A^|). (3.32) 
j=i 1=1 j=i e=j 



Hence, almost surely, both 



r m—r+j r m—r+j 

maxY Y ^ni^u) > maxj^ Y (°*^( A *A*) ~ ^ - m) |A^|) , (3.33) 

j=l 1=1 r ' m j=l i=j 
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and 



r m—r+j r m—r+j 

^ aX E E ^( A ^) = ™ aX E E ^n(At^). (3.34) 

Now choose any z > 0. Then 

r m— r+j r m—r+j 



™ ax E E {^BiiAtjj) - y/n{n T{j) - ir e ) | Ai^| J - max ^ ^ ^(As^) >z 

I r m—r+j r m—r+j \ 

<P max^ Yl Mi(At j>e ) - max^ £ M„(A^) > * , (3.35) 



3=1 1=1 



so that 



(r m—r+j r m—r+j « 

max V] V] (aiB e n (At jt t) - ^/n (vr r(j) - vr^) |At^|) - max^ V] cr^^As^) > z ) 

(r m—r+j r m—r+j . 

max^ ^ M^(Af^) - max^ ^ M^(Ai^) > z J 

r m—r+j r m—r+j \ 

max ^ ^ A^(Ai^)-max^ ^ M '( At i^) > * ) ' ( 3 - 36 ) 
/r '™ i=i £=i ' ^ m ,=1 <=i / 

by the Invariance Principle and the Continuous Mapping Theorem. Next, for any < e < 1, let 

Ir,m(e) = {(t jjE ) G I r>m : ^ |At^|l { ^ <WT(r)} < er}. 

Thus, I* m = l r ,m(0) C / r ,m(e) C I rjm (l) = I rjm . We bound (j3.36p using this family of subsets as 
follows: 



z 



r m—r+j r m—r+j 

? ax E E ^(A^)-max^ ^ M e (At jtt ) > 
r m j=i i=\ r ' m j=i e=i 

r m—r+j r m—r+j 

< P max 



+ 



r m—r+j r m—r+j 

T ? r ax ,X E M^A^)-max^ £ M e (At j/ ) > 
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r m—r+j r m—r+j 

<P( max V E ^(A^)-max^ £ B l (A Sj/ ) > 

\ Ir.m (S) j = l £=1 l r , m j = l ^ 

r m—r+j r m—r+j 

+ P[ max Y ^( At i/)~ max E Y B e (A Sj/ ) > z + erc 

\ Ir,m\Ir,m{e) . = l ^™ i= l l= \ 

r m—r+j r m—r+j 



<P max V Y B e (A tj/ )-m a xY, Y ^( A ^) > 
\ 7r ' m(e) j=i £=i r ' m i=i <=i 

(r m— r+j r m—r+j . 

max E E ^(At^)-maxE E B e (A Sj>i ) > z + ercj. (3.37) 
/r ' m 7 =1 fcl 7 *.™ , = 1 £=1 / 



We can now take the limsup in (|3.37p . as c — > oo, and then, as e — > 0, and so establish convergence 
to zero in probability. Moreover, since 

r m—r+j r m—r+j 

max E Y M e (At j;i )-max^2 ^ M^(At^) > I = 1, 
Ir ' m j= i i=i " 7 *>™ j= i e=i ' ' 

we have in fact shown, with the help of (j3.36|) . that with probability one, 

r m—r+j r m—r+j 

maxj^ Y M\At j>e ) = ma^Y^ Y M ^( A ^)> 

J = l ^=1 ' J = l £=1 

and thus 



r m—r+j r m—r+j 

7 ax E Y -V"(^t0") _7r <0 l A M) ~™ ax E E a i^i( As i,i) — ^ °- ( 3 - 38 ) 

3=1 l=j ] = 1 £=j 



Since 



r m—r+j r m—r+j 

W^Y Y ^(A Sjif )^max^ E a eB e (A Sj/ ), (3.39) 

by the Converging Together Lemma, we have proved (|3.27p . Equation (|3.8p of the theorem follows 
from the bijection between I* m and I T -m r ,d r described in the general result (|3.30p . 

Finally, we can obtain the convergence of the joint distribution in (|3.1U|) in the following manner. 
Given any (9\, 02, ■ ■ ■ , 9 r ) E R r , we have 
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r , k m—k+j 

= E 6k ( ™ aX E E (^i^j/) ~ >/« {^r(j) ~ *t) \ At j,- 

k=i ^ fe,m j=i e=j 

k m—k+j \ r / ^ m— fc+J \ 

"™ ax E E ^(A^J+E^l^E E ^n(A^)). (3.40) 

Now from (13.38P , the first summation on the right-hand side of (13.40P converges to zero in probabil- 
ity, as n —7- oo. Moreover, the second summation is a continuous functional of (B^,B^, . . . ,B™), and 
so, by the Invariance Principle and Continuous Mapping Theorem, converges. Then the Converging 
Together Lemma, along with the bijection result ()3.30p . gives 

r /-irk \ r / k m ~ k +3 \ r 

E^ (^-p) => E^U ax E E ^b £ (a Sj A = 5>y*. (3.4i) 

fc=l ^ V 71 J k=1 V fc,m J=1 £=J - / fe=1 

Since (13.41 p holds for arbitrary [9\, 02, ■ ■ ■ , 6 r ) G by the Cramer- Wold Theorem, we have the 
joint convergence result (|3.10p . ■ 

Since the shape of the Young diagrams is more naturally expressed in terms of the R*, rather than 
of the y n fc , we may restate the results of the previous theorem as follows: 

Theorem 3.3 Let (X n ) n >Q be an irreducible, aperiodic, homogeneous Markov chain with finite state 
space A m = {a\ < • • • < a m }, and with stationary distribution (tti, 7T2, . . . , vr m ). Then, in the notations 
of TheoremWM 



R l n - 7wr r(1 ) R\ - nvr r(2) R% - wr T(m) 



(R 00 ,Rl ,...,R^), (3.42) 



where 

R 

and, for each 2 < k < m, 



di 

I = max^,r TW (lT®{t x4 ) - B^iti^)) , (3.43) 



i=m k _ 1 + \ 

k-m k (d k +m k -k+j) 

+ max ^ a r(m k +e)B T ^ rik+i \Atj ) e) 

*-k — mj t .,dfc 



3=1 £=j 

k-l-mk-i (d k _ 1 +m k _ 1 -k+l+j) 



max 



h-l-m k _ 1 ,d k _ 1 . = 1 f_ 



E E a T{mk _ 1+e) B^-^(At j4 ), (3.44) 
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where we use the notation B s (Atj/) = B s (tjj) — B 8 (tj £—%), for any 1 < s < m, 1 < j < k, and 1 < 
£ < m, and where the first sum on the right-hand side of (|3.44j) is understood to be 0, if = m^-i- 

Proof. First, R\ = V r ], and, for each 2 < k < m, = V% — V* -1 . Expressing these equalities at 
the multivariate level, we have 



R n ~ n7r r<l) R n 



nir. 



r(2) 



n7r. 



r(m) 



in yj n 



v m - V 



m—1 



nir. 



-r(m) 



n 



n 



n 



V* - nv\ V 2 - nv 2 



V™ - nv n 



n yn 
(V 1 V 2 V m ) - (0 V 1 



11 



m—1 



■n 



n 



x ■ • ■ • ) ^oo 



m— 1^ 



/pi r>2 ~R m \ 



(3.45) 



where the weak convergence follows immediately from the Continuous Mapping Theorem, since the 
transformation is linear. Equations (|3.43p and (|3.44p follow simply from the Brownian expressions for 
V*,...,V£)m Theorem^ ■ 

If all m letters have unique stationary probabilities, then we have the following corollary to Theo- 
rem [3T3l 

Corollary 3.3 If the stationary distribution of Theorem \3.3\ is such that each ir r is unique, then 



K ~ n7I "r(l) R n ~ n7r r(2) Rn ~ ^r(m) 



n \ n sin 



AT((0,0,...,0),E). (3.46) 



In other words, the limiting distribution is identical in law to the spectrum of the diagonal matrix 
D = diag{Z\, Z 2 , ■ ■ ■ , Z m }, where (Z\, Z 2 , ■ ■ ■ , Z m ) is a centered normal random vector with covariance 
matrix E. 



Proof. Now, for all 1 < k < m, df. = 1, and = k — 1, so that 



di 

Rl = max Vo- tW [B^(t 14 ) - B T W(t v _i)) = a T[1) B^\l), 
and, for each 2 < k < m, 



m k k-m k (d k +m k -k+j) 

Rk oo= E ^(1)+ max £ £ ^^B^+^At, 



=m fc _i+l fc ' fc 3=1 £=3 
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fc-l-m fc _i (d k _ 1 +m k _ 1 -k+l+j) 



— max 

I k -l-m k _ 1 ,d k _ 1 ^ 

fc-i i i 

= E ^ (i) ^ i )(l) + maxEE (7 T( fc -i+^ fc - 1+£ >(A^) 
— ji i — — 

i=k— l i=i 

i j 

- max ^ ^ a r(fc _ 2+£) £^- 2 +^ (At,-,,) 

= ^(^d^^U) + a rW ^( fc )(l) - * T{k _ 1) B r < k - 1 '>(l) 
= a T{k) B< k \l). 

Moreover, the joint law result for (i?^, i?^, . . . ,R™) holds as well, and this is clearly a multivariate 
normal distribution, with mean (0,0,..., 0) and covariance matrix S. Since the spectrum of a diagonal 
matrix consists of its diagonal elements, the final claim of the corollary holds. ■ 

Remark 3.5 The joint law of (i?^,,^^, . . . ,R^) in the iid uniform alphabet case is identical to the 
joint law of the eigenvalues of an m x m traceless GUE matrix. Corollaru \3.3\ also gives a spectral 
characterization for the unique probability case, in particular, for a non-uniform iid alphabet with 
unique stationary probabilities. This is consistent with the characterization of the limiting law of LI n 
in the non-uniform iid case, due to Its, Tracy, and Widom 120[ \21f . as that of the largest eigenvalue of 
the block associated with the most probable letters among a direct sum of independent GUE matrices 
whose dimensions correspond to the multiplicities d r of Theorem \3.2\ and \3.3l subject to the condition 
that Ylr=i ^/vr T ( r )X r = 0, where X\,X2, ■ ■ ■ ,X m are the diagonal elements of the random matrix. 

Remark 3.6 The difference between the zero-trace condition YlT=i -^r = and the generalized trace- 
less condition Y1T=\ \/^r(rJ-^r = amounts to nothing more than a difference in the choice of scaling 
for each row . We will find it more natural to express our results using the normalization associated 
with the zero-trace condition Y1T=1 = 



4 Fine Structure of the Brownian Functional 

So far, we have seen that the limiting shape of the RSK random Young diagrams generated by an 
aperiodic, irreducible, homogeneous Markov chain can be expressed as a Brownian functional. The 
form of this functional is similar to the iid case; the essential difference is in the covariance structure 
of the Brownian motion. We begin our study of the consequences of this difference. 

In the iid uniform m-alphabet case, Johansson [22] proved that the limiting shape of the Young 
diagrams had a joint law which is that of the spectrum of an m x m traceless GUE matrix. An 
immediate consequence of this result is that the limiting shape of the Young diagrams contains simple 
symmetries, e.g., for each 1 < r < m, = — R™ +1 ~~ r . Now, as was seen in Corollary 13. II of Theorem 
13.21 the form of the Brownian functional in the doubly stochastic case involved only the maximal term. 
We will see that that there is also a pleasing symmetry to the limiting shape of Young diagrams in the 
doubly stochastic case by examining a natural bijection between the parameter set I r ^ m and I m - r ,m-, 
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for any 1 < r < m — 1. Indeed, this result will follow as a corollary to the following, more general, 
theorem: 

Theorem 4.1 The limiting junctionals of Theorem \3.2\ enjoy the following symmetry property: for 
every 1 < r < m — 1, 

m r r~m r (m r +d r -r+j) 

C:=E^(^ TW ( 1 )+„ , maX E E ^ T{mr+t) B^ + "\M 3/ ) 
t=l H-,-)G7 r _ m ,., dr 

m m r +d r —r r—m r +j 

k £ a T(i) ^«(l)+ max £ £ a T(mr+ ^^)(A^), (4.1) 

i= m ,. +( f r + l «t-,Oeim P +dr-r,dr j = 1 £=j 

w/iere 5 £ (A) := - £^(s), /or A = and where the non-maximal terms on the left and 

right-hand sides of (|4,ip are identically zero if m r = 0, or m r + d r = m, respectively. 

Remark 4.1 Recall that, from the definitions of m r and d r , the non-maximal summation terms on 
the left and right-hand sides of (|4.ip reflect the letters which have, respectively, greater and smaller 
stationary probabilities than ir T i r y Recall, moreover, that the maximal terms are associated with the 
indices having the same stationary probability as ir T r r \. The maximal term on the left-hand side of 
(|4.ip involves a summation over r — m r rows, while the one on the right-hand side involves m r+ i — r 
rows. Thus, in a sense, the two maximal terms in (|4.ip split d r = m r+ i — m r rows between themselves. 
In summary, the functional on the right-hand side of (|4.ip corresponds to the sum of the lengths of 
the m — r bottom rows of the Young diagrams. 

Proof. Without loss of generality, we may assume that r(j) = j, for all 1 < j < m. Fix 1 < r < m— 1, 
and for any point t in the index set I r -m r ,dri define Atj +rrir ^ = [tj^i^tj^], for 1 < j < r — m r and 
1 < £ < d r . Furthermore, for each 1 < j < m r or m r+ \ < j < m, set Atji = [0, 1], for j = £, Atjt = 
{0}, for < £ < j, and Atjj = {1}, for j < £ < m. Next, as in the proof of Theorem 13.21 consider the 
set of points {tj,e} {i<j< r ~m T ,i<i<d T )i an d order them as sq := < s\ < ■ ■ • < s K _i < s K := 1, for some 
integer k, and let As q = [s q -i, s q ], for each 1 < q < k. 

Now, for each 1 < q < k, let A q consist of the indices I for which As q n Ai,- ^ 7^ 0. Then, almost 
surely, 

m r r—m r (m r +d r —r+j) r m 

£ a t &0) + E E <r mr+i B^(At j>e ) = £ £ *i&{At i4 ) 
i=i j=i e=j j=i 1=1 



rum 



=EEE^( A ^ nAs «) 

j=l g=l l=\ 
r k 

= EEE^( As «)- ( 4 - 2 ) 

3=1 q=l KzA q 

Now by the "stairstep" properties of I r>m there are precisely r elements in each A q . Letting 
A q = {1, . . . , m} \ A q , for each 1 < q < k, we thus see that each A q contains exactly m — r elements. 
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Let £j : g be the j th smallest element of A q . We claim that for each 1 < j < m — r, the sequence 
tj,\i &3,ii ■ ■ ■ ->^j,K- is weakly decreasing. 

Indeed, fix 1 < j < m — r and 1 < q < n — 1, and suppose that £j >q is less than all the elements of 
A q . Then, by the properties of I r , m , the least element of A q+ \ is no smaller, so that the j smallest 
element of A q , £j :q +i is also £j jQ . Next, suppose that £j >q is greater than k > 1 elements of A q . Thus, 
£j t q = j + k. Then there are at most k elements of A q+ i which are less than or equal to £j q , by the 
properties of I r , m - But this implies that there are at least j elements of A q+ i which are less than or 
equal to £j tq . Thus, £j, q +i < £j, q , and the claim is proved. 

Moreover, since each A q contains {1,2,... , m r }, we see that necessarily each A q contains {m r + 
d r + 1, m r + d r + 2, . . . , m}. 

For each 1 < j < m — r, we may now amalgamate the intervals As q to obtain a partition of the 
unit interval. Specifically, for each 1 < j < m — r, and each 1 < £ < m, let v,j£ be the smallest s q such 
that £j jq +i < £■ (We define n Ji0 = 1, for all 1 < j < m — r.) 

Finally, and most crucially, recall that Y^T=i a £^(^) = 0) f° r all Then since (B l , B 2 , . . . , B m ) = 
(-B 1 , —B 2 , . . . , —B m ), 



EE E = E E E {-^(A Sq)/ 

j=l q=l £&A q j=l q=l i€ A q 

m m r +d r —r in 

= - Yl E E CT «v+^ mr+ '( A ^) 

i=m r +d r +l 3=1 1=1 

m m r +d r —r m 

^ ^#(1)+ Yl ^2* mr+ iB mr+e (&u h£ ), (4.3) 

i=m r +d r +l j=l i=l 

where A-u.^ = \uj^x,UjJ\. But, by the way we ordered each A q , we must have A.Uj 1: ( n Aii^ = 0, 
for any ji ^ ji- Thus, u £ I mr +d r -r,d r , and so we may restrict the summation over £ in f|4. 3[) to 
t = j, . . . ,r — m r + j, since the remaining terms are zero. Equation (|4.ip follows immediately by 
taking the maxima over I r -m r ,d r and I mr +d r -r,d r over the left-hand and right-hand sides, respectively, 
of (S3D. ■ 



For doubly stochastic transition matrices, the symmetry is even more apparent: 

Corollary 4.1 Let the transition matrix P of Theorem \3.S\ be doubly stochastic. Then, for every 
1 < r < m — 1, 

r m—r+j 

V&:= max £ £ a, - B%^{)) 

m—r r+j 

- , E E ^ (^(M - ^K^-i)) : = *sr r > ( 4 - 4 ) 

3=1 

and so 
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Ej=i R n ~ m/m ^ rn/m - £™ m _ r+1 Ri 



lim ■— _ ± Km _ ^j->"-^ z . (4.5) 



Moreover, 

(V^...,V^±(V™-\...,V™- r ). (4.6) 

Proof. Since m r = and d r = m for all 1 < r < m, the non-maximal terms on both sides of (|4. 1[) 
disappear, and we have (|4.4j) . 

To prove (|4.5|) . recall that V^™ = EJLi -Rn = Then, from the result just proved, 

V™~ r - ( to - r)n/m _ EgT ^ - (m - r)n/m 
n \fn 



( n ~ J2T= m -r+i R n) - (m - r)n/m 



n 

■3 



rn/m - £ i=m _ r+1 i% 



=C (4.7) 



and we have established the claimed symmetry. 

Finally, the extension of (|4.4p to (|4.6p follows from a standard Cramer- Wold argument. 



Remark 4.2 Since R™ = — V™~^, almost surely, Corollary \4-l\ states that R™ = —R^. From the 
symmetry of the Brownian motion, we thus see that R™ may be represented as a minimal Brownian 
functional: 



nil 



mm 



e-i, 



i=i 



a result also noted in 



Turning again to the cyclic case, recall that, for to > 4, the limiting shape of the Young diagrams in 
general differs from that of the iid uniform case. The following proposition characterizes the asymptotic 
covariance matrices of such Markov chains. 

Proposition 4.1 Let P be the to x to transition matrix of an aperiodic, irreducible, cyclic Markov 
chain on an m-letter, ordered alphabet, A m = {«i < «2 < • • • < ct m }, with 



( ai 

a-2 



P 



«3 



0,1 



«2 \ 

a 3 



a\ a m 
a 2 a% J 



(4.8) 
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For 1 < j < m, let Xj = Y1T=1 afc^* -1 ^' -1 ) be an eigenvalue of P, where uj = exp(2iri / m) is the 
m th principal root of unity. Let also 7j = Aj/(1 — Xj), for 2 < j < m, and (3j = cos(2irj /m), for 
< j < m. Then, the asymptotic covariance matrix £ is given by: 

For m = 2mo + 1, 



. mo+l 



(4.9) 



J'=2 



and for m = 2m,Q, 



1 A m ° 9 



J=2 



(4.10) 



where is an m x m Toeplitz matrix with entries (M^)^ = j3^j_i^_^, /or 2 < j < m, and 

(M«) M = 5 k/ - (1 - <y M )/(m - 1), for j = 1. 

Proof. It is straightforward, and classical, to verify that, for each 1 < j < m, (1, u> J ' -1 , w 2 ^' -1 ), . . . , o;( Tra_1 )(j , ~ 1 )) 
is a left eigenvector of P, with eigenvalue A,,- = J^feLi OfcW^ -1 ^' -1 ). We can thus write our standard 
Jordan decomposition of P, which in this case is a true diagonalization, as P = S~ 1 AS, where 
A = diag(l,A 2 ,...,A m ), 



/I 
1 



S = 



UJ 
2 



UJ 



UJ 
UJ 4 



1 \ 

u m-l 
w 2(m-l) 



\1 W™" 1 U! 2(m -^ 



(4.11) 



(m-l) 2 y 



and 



s~ 1 = - 



/l 
1 
1 



m 



UJ 

UJ~ 



UJ 

UJ' 



1 \ 



-2(m-l) 



\1 oH™- 1 ) w-^™- 1 ) 



(4.12) 



a; 



-(m-l) 2 J 



In the present cyclic, and hence, doubly stochastic case, we know that £ = (l/m)(I + S 1 1 D5 + 
(S' 1 DS) T ), where, as usual, D = diag(7i,7 2 , . . .,7m) = diag(-l/2, A 2 /(l - A 2 ), . . . , A m /(1 - A m )). 
We can then compute the entries of S~ 1 DS as follows: 



(S 1 DS) juj2 = ^2(S 1 )j 1 ,k{D)k,i{S)ij 2 

k,e 

= E 1 ( w "°' 1 " 1)( *" 1) )(^)(^" 1)( '" 1) ) 
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m 

T* w (J2-ji)(*-i) 



£ 

fc=i 

^-; + ew« m ], (4.i3) 



-04 

k=l m 

1 / 1 ; " 



fc=2 



for all 1 < 3\ij2i < m - The entries of the asymptotic covariance matrix £ can thus be written as 



(7 



51 ,32 



1 

m 



(<S jlJa + (S- l DS) n , n + (S-^S)^) 

\ \ k=2 

2 m 



m - 1 . (i) 2 



(4.14) 



fc=2 



for all 1 < ji, j2, < m. 

Next, note that since X m+ 2-k = i.e., the complex conjugate of X m+ 2-k, we have 7 m +2-A; = %, 
for all 2 < k < m. Moreover, since /%2-ji|(fc--i) = /%2-jil(("H-2-fc)-l)> we can write (|4. 14|> more 
symmetrically as (14. 9h or (I4.10p . depending on whether m is odd or even, respectively, and in the 
latter case, we also use that 7 mo +i is real, since oj mo = — 1. ■ 



Let us again examine the cases m = 3 and m = 4. In the former case, we have 

M« = 



1 -1/2 -1/2 N 
■1/2 1 -1/2 
4/2 -1/2 1 



(4.15) 



But for m = 3, fii = -1/2 = /3 2 , and so MW = Hence 

S = + ~i?e( 72 )M^ = ^(l + 2Re(j 2 ))M ( - 1 \ 

Hence, for m = 3, cyclicity always produces a rescaled version of the uniform iid case, with the 
rescaling factor given by 1 + 2Re[^ 2 ). 
For m = 4, however, 



and /3i = 0, j3 2 





/ 1 




-1/3 


-1/3 


l/3\ 




-1/3 


1 


-1/3 


1/3 




-1/3 - 


-1/3 


1 


1/3 




V-l/3 - 


-1/3 


-1/3 


1 / 


= 0. Thus, 



















-1 0\ 




M< 2 ) = 





1 


-1 








-1 





1 










-1 


I J 
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and 



In this case, we have 



( 1 - 1 


1 


-1 1 


-1 


1 -1 


1 


V-i i 


-1 



JL M (i) + l^^^M^ + ^ 73 M( 3 ). 
lb lb lb 



Next, note that 2M^ + = 3M (1 \ Then, if Re(-y 2 



73, 



E 



^ M (l) + 4 iJe( )Af (2 ) + 2 73M(3 ) 
lb lb lb 

3 M (l) + 2 (2ife( )M(1)) 
lb lb 



lb 



;i + 2fe( 72 ))M( 1 ), 



(4.1b) 



so that there is still a rescaled version of the iid case in a non-iid cyclic setting. Indeed, since we know 
that A2 = a\ + ia 2 — a 3 — ia^ = {a\ — a 3 ) + 2(02 — 04) and A3 = a\ — a 2 + a 3 — 04, we find that 



-Re (72) 



1 — a>2 — 2a3 — 04 



1. 



73 if and only if a| = 02^4. 



(a 2 + 2a 3 + a 4 ) 2 + (a 2 - (I4) 2 

and 73 = l/(2(a 2 + 04)) — 1. A short calculation then shows that i?e(7 2 ) 
We thus have a complete characterization of all 4- letter, cyclic Markov chains whose Young diagrams 
have the same limiting shape as the uniform iid case. In particular, choosing 02 = 04 = a, for some 
< a < 1/3, leads to 03 = a and a\ = 1 — 3a. If, moreover, a = 1/4, we have again the iid uniform 
case. For a ^ 1 /4, however, we may view the Markov chain as a "lazy" version of the uniform iid case. 

Note that the scaling factor in both (|4.15j) and (|4. 16[) is 1 + 2Re{^2)- The following theorem shows 
that, in fact, such a scaling factor occurs for general m, and gives a spectral characterization of cyclic 
transition matrices which lead to an iid limiting shape. 

Theorem 4.2 Let P be the mx m transition matrix of an aperiodic, irreducible, cyclic Markov chain 
on an m-letter, ordered alphabet given in Proposition \4-l\ Then the asymptotic covariance matrix £ 
is a rescaled version of the iid uniform covariance matrix Sj^ n := ((m — l)/m 2 )M^ if and only if 



Re 



A; 



1 - A, 



for all 2 < j < m, 



for some real constant 7. Moreover, the scaling is then given by 

E = (1 + 27)E iidn . 



(4.17) 



(4.18) 
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Proof. We first claim that the system of matrix equations 



3=2 



(4.19) 



has a unique solution bj = l/(m — 1), for all 2 < j < m. Indeed, revisiting (|4.14p . we can express each 
MW as 



where (M^) k/ = wU-W-Q /2, for all 1 < k,£ < m, so that (127191) becomes 



(4.20) 



i=2 



3=2 J=2 



(4.21) 



where bj := (bj + 6 m „ J+ i)/2, for 2 < j < m. 



Now, clearly, each M^) [ s cyclic, so that in solving ()4.21j) we need only examine the m entries in 
the first rows of the matrices. We can thus reduce (|4.21|) to the m x (m — 1) system of equations 



/ 1 


1 


1 










uj 2 


a; 3 








OJ 2 




CO 6 


\u m - 1 


u 2(m-l) 


w 3(m- 



00 



1 \ 

w m-l 
2(m-l) 



(b 2 \ 

h 

\bmj 



I 1 \ 

-1 

m— 1 
-1 

m— 1 



\m-l/ 



(4.22) 



Since each of the last m — 1 rows of the matrix in (I4.22P sums to —1, it is clear that bj = l/(m — 1) is 
a solution to the system. To see that this solution is, in fact, unique, consider the (m — 1) x (m — 1) 
sub-matrix consisting of the last m — 1 rows of the matrix in (14.22H , namely, 



00 
2 



00 



00 

oo A 



00 

oo G 



w 2(m-l) 



m— 1 , ,2(m— 1) , ,3(m— 1) 



(m-l) : 



(4.23) 



Now this matrix, which is very closely related to the Fourier matrix which arises in discrete Fourier 
transform problems, is in fact invertible, and can be shown to have one eigenvalue of —1, and m — 2 
eigenvalues of the form ±y/m and ±i^/m, so that the modulus of the determinant is rrS m ~ 2 ^ 2 ^ 0. 
Thus, the solution bj = l/(m — 1) is unique, and since bj = (bj + 6 m _ J+ i)/2 = b m -j + i, for all 
2 < j < m, we conclude that bj = l/(m — 1) as well, for all 2 < j < m, and the claim is proved. 

We can now use Proposition 14.11 to simplify the asymptotic covariance matrix decomposition as 
follows: 



1 o 

k=2 
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m 



m — 



k=2 



l M ( 1 )+2 7 ^-M( 2 ) 



(1 + 2 7 ) 



m 



in- 



(1 + 27) S ii(iu , 



(4.24) 



where 7 = Re(jj), for all 2 < j < m. If the real parts of 7^ are not all identical, then the uniqueness 
of the solution of (|4.19p implies that no such simplification is possible, and the theorem is proved. ■ 



Remark 4.3 To see that the condition in (|4.17p is not vacuous for any m, recall that for m = 4 ; the 
"lazy" chain has the iid limiting shape. This is true for general m: if a2 = 03 = • • • = a m = a, for some 
< a < l/(m— 1), then Xj = 1— (m— l)a, for all2 < j < m. Trivially, then, 7™ = l/((m— l)a) — 1 := 7, 
/or oZZ 2 < j < m, so that the conditions of Theorem \4-%\ are satisfied, and the scaling factor is given 
by 1 + 27 = (2 — (m — l)a)/((m — l)a). -Even in i/te m = 4 case, however, we saw that there were 
other, more general, cyclic transition matrices which gave rise to the iid limiting distribution. 

The previous proposition indicates precisely when we may expect the limiting shape of a cyclic 
Markov chain to be spectrum of the traceless GUE. Now the first-order behavior of all rows of the 
Young diagrams is n/m + 0(y/n) for cyclic Markov chains. Although this differs from the first-order 
behavior in the non-uniform iid case, one may still ask whether the limiting shape for some cyclic 
Markov chains might still be that of some non-uniform iid case. In fact, this can never occur: cyclicity 
ensures that the asymptotic covariance matrix is also cyclic, and thus cannot be equal to the asymp- 
totic covariance matrix of any non-uniform iid case. 

Another class of Markov chains whose asymptotic covariance matrices can be easily studied is the 
class of reversible Markov chains, i.e., those with transition matrices such that vrjPjj = njPjj, for 
all 1 < i, j < m. The following theorem describes the asymptotic covariance matrix of such Markov 
chains and, in the doubly stochastic case, gives necessary and sufficient conditions for recovering a 
rescaled uniform iid asymptotic covariance matrix. 

Theorem 4.3 Let P be the transition matrix of an aperiodic, irreducible, reversible Markov chain on 
an m-letter, ordered alphabet A m = {&i < 0.1 < ■ ■ ■ < a m }. Then the asymptotic covariance matrix E 
is given by 

H = Ii 1 / 2 S T {I + 2D)SH l/2 , (4.25) 

where P = (5n 1 / 2 ) _1 A(5II 1 / 2 ) is the diagonalization of P. If, moreover, P is doubly stochastic, i.e., 
if P is symmetric, then S is a rescaled version of the iid uniform case with transition matrix P u ad if 
and only if P = aP u ud + (1 — a)l , for < a < m/(m — 1) . 

Proof. It is elementary that P is similar to a symmetric matrix Q. Specifically, P = Ii^ l / 2 QH 1 / 2 , 
where II is the diagonal matrix containing the stationary distribution. Since Q is symmetric, it is 
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diagonalizable with orthogonal eigenvectors. Writing Q = S T AS, we have P = (STI 1 / 2 ) 1 A(SH 1 ^ 2 ). 
But then 

e = n + n(5n 1 / 2 )- 1 L>(5n 1 / 2 ) + (su^^Disu^fu 
= n + 2n 1/2 5 T z?5n 1/2 
= n 1/2 (/ + 25 T L>5)n 1/2 

= H 1 / 2 S T {I + 2D)SIi 1 / 2 . (4.26) 
where D = diag(-l/2, A 2 /(l - A 2 ), • • • , A m /(1 - A m )). Writing A E := (J + 2D), we find that 

1 + A 2 l + K 



A T , = diag 



1_ A Ar 



In the doubly stochastic case, (I4.26D constitutes the diagonalization of E, since II = (l/m)I. 
If X = aTiuiid, for some a > 0, then the spectrum of E must consist of and m — 1 identical 
positive eigenvalues. Therefore, A 2 = A3 = • • • = A m = A, so that A = diag(l, A, A, . . . , A). Thus 
A s = (1 + A) (J - A)/(l - A) 2 , so that (jl^Bl now becomes 



S = ~S T AxS = *±*ST(J - A)5 = //^ (I - P). ( 1.27) 



In particular, T, uii( i = (l/m)(I — P u ud)- Since E = aH,^, for some a > 0, it follows that 

(1-A) 2 / (1-A) 2 \ , s 

i 3 = a V 1 + A ; P U iid+ U-a \ + A (4-28) 

After setting a = a(l — A) 2 /(l + A), and checking that P is irreducible for < a < m/(m — 1), the 
theorem is proved. ■ 

Remark 4.4 Recall that in the A-letter cyclic case, where P had initial column entries 01,02,03, and 
04, the criterion for obtaining a rescaling of the covariance matrix in the iid case was that a 2 = 020,4. If 
we further demand that P be symmetric, then a 2 = 04, and the criterion if refined to a 2 = 03 = 04 = a, 
which is consistent with Theorem\4-3\ as P = 4aP u nd + (1 — 4a) I, for < 4a < 4/3. 



We now seek to relate iid non-uniform limiting shapes (which are spectra of direct sums of GUEs) 
to those of general Markov chains having the same stationary distribution. The following interpolation 
result describes the asymptotic covariance matrix for a one-parameter convex class of Markov chains, 
the covariance matrix being given as a linear combination of the "base" covariance matrix and that 
of a closely-related iid case. 

Proposition 4.2 For any m > 3, let Po be the m x m transition matrix of an irreducible, aperiodic, 
Markov chain, and let its associated asymptotic covariance matrix be given by 

E = n + UoiS^DoSo) + (S^DoSofllo, (4.29) 
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in the standard notations of Theorem \2.1\ Then, for < 8 < 1, i/ie transition matrix P = (1 — 5)J m + 
<5Po arj asymptotic covariance matrix given by 



S = i(S + (l-<5)Sn ), 



(4.30) 



where Sn is the covariance matrix associated with the iid Markov chain having the same stationary 
distribution as Pq. 

Proof. Using the standard notations of Theorem I2.lt we will write 

E = n + UiS^DS) + (S^DSfU 



in terms of the decomposition So in (|4.29p . Now, clearly, the stationary distribution under P is that 
of Pq, so that II = n . We will thus write the stationary distribution simply as (ni, 7r 2 , . . . , ir m ). 
Moreover, the eigenvectors are also unchanged, so that S = So. However, for each eigenvalue A^o of 
Pq, we have that = (1 — 8) + 8\)-o is an eigenvalue of P, for 1 < k < m. Thus, for each 2 < k < m, 
the diagonal entries of D are given by 



Ik ■-- 



(1-5) + 5X k , 1 - 5 



+ 7fc,o> 



1 - X k 5(1 - A fc)0 ) 5 
where 7&o are the diagonal entries of Dq. We can thus decompose D as follows: 



D = diag(-l/2,72, ... ,7m) 
= diag 



1 /2. 0) + ( i-^ J diag(0, !,...,!)+ Q ) rliagdJ. >>,,. . 



,7ro,oJ 



diag 



1-A A /1-5 

).o U 



/,„ ( l - ) A,. 



(4.31) 



Next, recall from Proposition 12.11 that the first column of S 1 is (1, 1, ... , 1) T ■ Hence, 



S~ X DS = Sq 1 DS q 
(l * ■ 



\1 * ••• */ \ 



7Ti 7T2 



) V 



* 5*0 + ( 7 ) #0 1 DqSq 



1-5 
25 



/iT 1 7T 2 • • • TT m \ 
7Ti 7T 2 • • • 7T„ 



\7Ti 7T 2 • • • 7T m / 



+ 



1 — 8 



^ m + ( ^ ) ^0 A)<Sb> 



(4.32) 
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which gives us 



us^ds = UqS^ds 



1-5 



/tti 



25 



0\ / 



o 7r 2 •■• ; 

O : '■• : 

V TT m J 



7Tl 7T2 
7Tl 7T2 



+ ( n + ( \ ) UoS^DqSq 



5 

( *? 



2(5 



7T27T1 



VT17T2 
7T 2 



\7T m 7ri 7r m 7T2 



7Tl7T m \ 



+ ( i— ^ Il„ ' (^) ' ' 



7T m 



\7Tl 7T 2 • • • 7T m / 



(4.33) 



Finally, we can express £ as 



s = n + n(5^ 1 z?5) + (s^DSfn 

±) n + (i - n + u (s~ 1 ds) + (Uois^DS)) 1 



= []) ^o+(i-]) (n -2n ) 

/ 7T 2 7T17T2 
7T27T1 7r| 



\7T m 7ri 7T m 7r 2 



lU+fi-lp-E,,., 



vr 2 7r m 



= -(S + (l-<5)S no ), 



(4.34) 



and we are done. ■ 

Thus far we have expressed our limiting laws in terms of Brownian functionals whose Brownian 
motions have a non-trivial covariance structure arising directly from the specific nature of the transition 
matrix. It is of interest to instead express the limiting laws in terms of standard Brownian motions. 

Since the asymptotic covariance matrix £ is non-negative definite, we can find an m x m matrix 
C such that £ = CC T . Clearly, we then have 
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(a 1 B 1 (t),a 2 B 2 (t), . . . , a m B m {t)) T = C{B x {t), B 2 (t),..., B m (t)f, (4.35) 

where (B l (t) , B 2 (t) , . . . , B m (t)) T is a standard, m-dimensional Brownian motion. 

In order to simplify notation, we will assume that t(£) = £, for all £, and so write our main result 
(13.81) in Theorem I3J2I as 



T/r m r r—mr (d r +m r —r+j) 

Vn-vm ^j^ ak&{1)+ ^ £ CTmr+£J B m ^(A^) := t£. (4.36) 

V n k=1 ir- mr ,dr j = l £=j 

Simply substituting ()4.35p into (|4.36p immediately yields 

m r I m \ r—m r (dr+mr—r+j) / m \ 

c = E E^-^ 1 ) +, max E E E c ^+^( At ^) 

fc=l W / 'r-mr,^ j = 1 £=j \ {=1 J 

m m r m r-m r (d r +m r -r+j) 

= EE c v E?i (i) + r max E E E cv-kMa^). (4.37) 

i = l k=l Jr-m r ,d r . = 1 ^ £=j 

Now the first term in (|4.37p is simply a Gaussian term whose variance can be computed explicitly. 
Unfortunately, the maximal term does not in general succumb to any significant simplifications. How- 
ever, in the iid case, we can further simplify (|4.37p in a very satisfying way. Indeed, since, in the iid 
case, we have a? = 7Tfc(l — ir^) and, for k ^ £, a^/ = —T^k^e, one can quickly check that C can be chosen 
so that Ck,k = ~ \p^k^k^ and, for k ^ £, C^/ = —y/WeTTk- Moreover, for all m r + 1 < k < m r + d r , 
7Tfc = 7r mr+ i = 7r r . Then, within the maximal term, C mr+ £^ = ^JiL r — Tr r ^Ti\., for i = m r + £, and 
Cm r +i,i = —Kry/Tti.-, for i ^ m r + 1. With the convention that uq = 0, we can then express (14.371) as 

vl = £ v^b\i) + £ Y,{-^ k )B\i) 

i=l i=l k=l 

f-r—rriT (dr+mr—r+j) m r—m T (dr+m r —r+j) ^ 

+ max \J2 Yl ^(A^+EE E (-^V^W^) 

*r— m r ,d r • 1 ■ -i /? ■ J 

3 = 1 1=3 i=l 3=1 t=3 

m r m m r 

= ^^ l (i)-E^ l d)E^ 

i=l i=l k=l 

■ r—mr (dr+mr—r+j) m r—mr (dr+rrir—r+j) 

+ ~ 



~ r max \ Yl E ^ mr+ '(A^)- v^E^ E E B \^)\ 

Ir- mr , dr I j = 1 £=j i=1 j = 1 e=j J 

f m r m m 

| V^B\l) - y mr y/n&{l) - n r (r - m r ) ]T V^^(l) } 

r-m r (d r +m r -r+j) 

+ max ^ E B mr+i (At jt i) 
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, m r m n r-m r (d r +m r -r+j) 

= max E £ mr+ '(A^) 

I ■ -1 -i J lr — m r ,dr -t t, 

1 = 1 1 = 1 J = l £=J 

m r m 

= {(i-v r )Y^F i B i (i)-v r Yl Vn&ii)} 

i=l i=m r +d r + l 

m r +d r r—m T (dr+m r — r+j) 

+ y/T r {-v r ^(1)+ max Y E B m ^(At j>e )\. (4.38) 

i=m r + 1 3 = 1 £=j 

Note that the first two Gaussian term of (|4.38p are independent of the remaining two Gaussian- 
maximal expression terms. 

Following Glynn and Whitt[T3"] and Barishnykov[4j, who studied the Brownian functional 

m 

D m = max^^(A^), 

h ' n l=i 

we define the following, more general, Brownian functional: 

r (m—r+j) 

Z) rim :=max^ ^ B^At^), ( 4 - 39 ) 

"' m 3=1 t=j 

where 1 < r < m. Clearly, the maximal term in (|4.38p has just such a form. We also remark that 
D r ,m corresponds to the sum of the r largest eigenvalues of an m x m GUE matrix. 

To better understand (I4,38p . we may, without much loss in generality, focus on the first block, that 
is, values of r such that m r = 0. The first Gaussian term of (|4.38p thus vanishes, and, writing ir max 
for 7r r , we have 

m / di \ 

Y,B\l) + D rA . (4.40) 
i=di+l V i=l J 

In the uniform iid case, the first Gaussian term of (|4,40p itself vanishes, since d r = d\ = m, and we 
have 



yr 



-7= f"-E^(l)+0 :=^- (4-41) 



Furthermore, and still specializing (|4.40p to r = 1, 



-TTmaz E V^-^W + V^max f -^max E ^ Q-) + Dl,d L 
j=a!i+l \ i=l , 

m / ^ \ d 



m / 1 \ 

max I , T^max Y B\l) + y/^H 1>dl . (4.42) 

i=<2i+l ^ 1 ' i=l 



39 



One can easily compute the variance of the Gaussian terms in (j4.42j) to be TT max (l/di — ir max ). For 
di = 1, this becomes TT max (l — Tr max ), and since H\ \ = a.s., the limiting distribution is iV(0, 7r max (l — 
f maj .)). For d\ = m, the variance of the Gaussian term vanishes, and we again recover (|4.4ip . namely, 
Hi >m /y/rn. Both of these results are consistent with those of the authors' previous paper [17] . 

5 Connections to Random Matrix Theory 

For iid uniform m- letter alphabets, the limiting law of the shape of the Young diagrams corresponds 
to the joint distribution of the eigenvalues of an m x m matrix from the traceless GUE [22]. In the 
non-uniform iid case, we further noted that Its, Tracy, and Widom [20[ [2T] have described the limiting 
law of the length of the longest increasing subsequence as that of the maximal eigenvalue of a random 
matrix consisting of independent diagonal blocks, each of which is a matrix from the GUE. The size 
of each block depends upon the multiplicity of the corresponding stationary probability. In addition, 
there is a zero-trace condition involving the stationary probabilities on the composite matrix (see [19] 
for the RSK diagrams result). 

As a first step in extending these connections between Brownian functionals and spectra of random 
matrices, recall the general case when the stationary probabilities are all distinct (see Remark l3.5p . Our 
Brownian functionals then have no true maximal terms, so that the limiting shape, (i?^, i?^, . . . , R^) 
is simply multivariate normal, with covariance matrix £ (or, more precisely, the matrix obtained by 
permuting the rows and columns of £ using r, the permutation of {1,2, . . . , m} previously defined). 
Trivially, this limiting law corresponds to the spectrum of a diagonal matrix whose elements are 
multivariate normal with the same covariance matrix £. 

We can see that this general result is consistent with the iid non-uniform case having distinct 
probabilities. Indeed, each block is of size 1, and is rescaled so that the variance is tc t (s\(1 — tt t ({\), 
for 1 < i < m. Because of this rescaling, instead of having a generalized zero-trace condition, as in 
the non-rescaled matrices used in [201 121j . our condition is rather a true zero-trace condition. This 
zero-trace condition is clear, since the covariance matrix for any iid case (uniform and non-uniform 
alike) is that of a multinomial distribution with parameters (n = 1; ir T m, 7r T (2)> • • • , i^ T {m))i an d any 
(Y%, Y%, . . . , Y m ) having such a distribution of course satisfies YliLi ^ = 1> so that Var(^™ x Yi) = 0, 
which implies the zero-trace condition for (R^, R^, , . . . , R^)- 

Next, consider the case when each stationary probability has multiplicity no greater than 2. One 
may conjecture that (R^, R^, . . . , R™) is equal in law to the spectrum of a direct sum of certain 
2x2 and/or lxl random matrices. Specifically, let k < m be the number of distinct probabilities 
among the stationary distributions. Then the composite matrix consists of a direct sum of n GUE 
matrices which are as follows. First, the overall diagonal (Xi, X2, • • • , X m ) of the matrix has a N(0, E) 
distribution. Next, if d r = 1, then the GUE matrix is simply the lxl matrix (X r ). Finally, if d r = 2, 
then the GUE matrix is the 2x2 matrix 



whose off-diagonal random variables Y mr+ i and Z mr+ \ are iid, centered, normal random variables, 
independent of all other random variables in the overall matrix, with variance 





+1 - 2Pmr+l,m r +2Vrn r +l<7m r +2 + C r m ,.+2)/ 4 - 
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Such a conjecture would imply the following marginal result regarding a single block of such a matrix, 
which without loss of generality we take to be the first block. In fact, this result is a genuine extension 
of the connection between maximal Brownian functionals and random matrices beyond the standard 
GUE, traceless GUE, and purely diagonal cases already discussed. 

Theorem 5.1 Let d x = 2 and r(r) = r, for all 1 < r < m. Then (R^R 2 ^) = (V£,V£ - V*,) is 
distributed as the spectrum {(Ai, A2) : Ai > A2} of the 2x2 Gaussian Hermitian matrix 

where (X\, X2) is a pair of centered, bivariate normal random variables with covariance matrix 

E 2 =(J* rt), (5.2) 

and Y\ and Z\ are iid, centered normal random variables, independent of (X±,X 2 ), with variance 

{a\ - 2pd x a 2 + al)/4. (5.3) 

Proof. We prove the equivalent result that (V^, V^) is distributed as the pair (Ai, Ai + A2). Now, by 
definition, 



(V^Vl) = [^ i {aiB\t) + a 2 B\l)-a 2 B\t)),a 1 B\l) + a 1 B\l) j 

'd 2 B 2 (l) + max (^B^t) - a 2 B 2 (t)) , cr^l) + a x B 2 {l)} . (5.4) 



We now simplify (j5.4|) . by introducing new Brownian motions and then decomposing the resulting 
expression into two independent parts. To do so, begin by defining the new variances and correlation 
coefficients o~\ := a\ := a\ — 2pa x a 2 + o\, and p := (p~a x — a 2 )/ ' yj af — 2pd x a 2 + a\. Then it is 
easily verified that B x (t) := B 2 (t), and B 2 (t) := {a x B l {t) — a 2 B 2 (t))/a 2 , where the equalities are 
pointwise, are (dependent) standard Brownian motions, and (|5.4|) becomes 

{VL Vl) = {oiB\l) + <J 2 max B 2 {t), 2a 1 B 1 (l) + a 2 B 2 (l)) 

(o x B\l) - pa x B 2 {l)) + a 2 (p^B 2 (l) + max B 2 (t)), 

\ G 2 0<i<l / 

2{a x B x {\) - po x B 2 {\)) + (a 2 + 2pa 1 )B 2 (l)) . (5.5) 



Note that B l {t) — pB 2 (t) is independent of B 2 (t) and has variance o~\{l — p 2 )t. Introducing the 
Brownian functional 

U(0) = (p- B 2 (l) + max B 2 (t), (5.6) 
j3 € R, and using erf, a?., and p above, (|5.5p becomes 
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(VL Vl) = o^\- ?Z(1, 2) + (a 2 U Q - p^) , (<7 2 + 2pa 1 )B 2 (l)^ 



\Jo\- 2pd\d 2 + a. 



T 



+ [sj~al-2p~a 1 a 2 + al U( = 2 ) , 2(fff - a 2 )ff 2 (l) ) , (5.7) 



where Z is a standard normal random variable independent of the sigma- field generated by B 2 . 

Turning now to the eigenvalues' distributions, we first consider the centered, multivariate normal 
random variables (WijWz), having covariance matrix 

/ o\ pa x a 2 \ 

\P0X02 of J ' 

and let W3 and W4 be two iid, centered, normal random variables, independent of (W\,W%), with 
variance a\. Then it is classical that 



(w 2 ,^W 2 + W 2 + W 2 ) £ a 2 {B(l), 2 max B(t) - B(l)), 

or, equivalently, 



W 2 ,(3W 2 + ^W 2 + W 2 + wfj =a 2 (B(l),U((3)), 



(5.8) 



where B is a standard Brownian motion, and U((3), (3 £ M, is defined in terms of B, rather than in 
terms of B 2 , as in (15.61). Then consider the random variable 



A := Wi + a/W 2 + W£ + W£ 
= (Wi ~ P^j + (p^ + sjwl + W 2 + W*\ . (5.9) 
Using (|5.8p . and noting that the variance of the first term in (|5.9p is <r 2 (l — /? 2 ), it is easy to see that 

A £ tri + 2 ^ U {^) > ( 5 - 10 ) 

where Z is a standard normal random variable independent of i?. 

We now apply this result to the eigenvalues of the matrix M in (|5.ip . namely, to 



and 



A 2 = (^^) -\ ( ] + + (5-12) 
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Letting Wi = (X t + X 2 )/2, W 2 = (X 1 - X 2 )/2, W 3 = Yi, and W 4 = Z u we have 



(Ai, Ai + A 2 ) = (Wi + y/w$ + W$ + W%, 2W^j 



W 1 - PyW 2 j + 2 [P^W 2 + - V Wj + Wi + Wf), 
2(W 1 -p^W 2 ] + 2p^-W 2 
W 1 -p^W 2 \l,2) 

<?2 J 

+ (p^ W 2 + + Wf + W 4 2 , 2/5g^ 2 ) , (5.13) 

where a\ = (erf + 2/5<7i<7 2 + er 2 )/4, a| = (af - 2pd\d 2 + cr|)/4, and p<7 2 <7| = (of - <7 2 )/4. Noting that 
the variance of W\ — {po\j a 2 )W 2 is a\(\ — p 2 ) = af(l — p 2 ), and that, moreover, (3 := p<j\j2o 2 = 
(a 2 — a|)/(2- v /crf — 2pa 1 G 2 + of), we find that 



(Ai, Ax + A 2 ) = ai VT^Z(1, 2) + ( 2 (j 2 [/(g) , 2p^B 2 {l)) 
= en Vl - P 2 ^(l,2) + o- 2 ([/(/3),4/3i? 2 (l)) 

= (5-14) 

and we have our identity in law. ■ 

The preceding theorem marks a first step in describing the increasingly robust connection between 
Gaussian random matrices and maximal functionals of several Brownian motions. We next turn to 
another very striking relationship between a different class of 2 x 2 Gaussian random matrices and a 
maximal functional of two independent Brownian bridges. As we will see, this result is, in fact, more 
fundamental than the connections noted thus far in that it furnishes a conditional equivalence in law 
from which many by-now classical results may be derived. 

By a standard Brownian bridge terminating at b we mean a process (B(t; 6))o<t<i equal in law to 
(B(t) — tB(l) + bt)o< t <i, where B is a standard Brownian motion. 

Theorem 5.2 Let {B l {t; &i))o<t<i and {B 2 (t; b 2 ))o<t<i be two independent standard Brownian bridges 
terminating at b\ and b 2 , respectively, and let 

M»:=( yi !' iZl Y 't Zl )' <^> 

where b\ and b 2 are real constants, while Y\ and Z\ are independent centered normal random variables 
with variance 1/2. Then the largest eigenvalue of has the same law as 

V bb := max ((B^&i) - B 1 ^;^)) + (B 2 (l;b 2 ) - B 2 {t;b 2 ))). (5.16) 
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Proof. We first simplify the familiar-looking expression (|5,16p and obtain 



V bb := max ((B^t; h) - B\0; h)) + (B 2 (l; b 2 ) - B 2 (t; b 2 ))) 
= max i {B 1 (t;b 1 ) + (b 2 -B 2 (t;b 2 ))) 

= b 2 + max (S 1 ^; 61) - B 2 (t;b 2 )). (5.17) 

Since a linear combination of two independent Brownian bridges is again a Brownian bridge, by 
elementary rescaling we obtain the equality in law 

V bb = 6 2 + \/2maxB ft; h AzJ//l ) ( 5 .1 8 ) 
0<t<i V y/2 J 

where B is a standard Brownian bridge terminating at {b\ — b 2 )/\/2. 

Next, recall the elementary result (which may be easily obtained from the Reflection Principle) 
stating that the maximum M(b) of a standard Brownian bridge terminating at b has a distribution 
given by 

P(Af (6) > a) = exp (-2a(a - &)), 

for any a > max(6, 0). 

Applying this result to (|5.18p . we find that 



P(Vn, > a) = 

= exp I —2 



max B t; ^— > — 

\o<*<i V y/2 J 



b 



V2 

a — b 2 \ / a — b 2 b\ 



y/2 J V V2 y/2 
exp(-(a-fe 1 )(a-6 2 )), (5.19) 



for all a > max(6i, 62)- 

To complete the proof of the claim, we simply note that the eigenvalues of M bb satisfy the charac- 
teristic equation 

(A-& 1 )(A-6 2 )-y 1 2 -Z 1 2 = 0. 

Writing Ai for the largest eigenvalue of M bb , we see that the event {Ai > a}, for a > max(6i, b 2 ), may 
be rewritten as the event {Y 2 + Z\ > (a — 6i)(a — b 2 )}. But Y 2 + Zf is distributed as a (rescaled) 
Chi-squared random variable with two degrees of freedom and is in fact exponential with parameter 
1. Thus, 

P(Ai > a) = P(y x 2 + Z\ >{a- h)(a - b 2 )) 

= exp(-(a-b 1 )(a-b 2 )), (5.20) 

and our claim is proved. ■ 
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Remark 5.1 The significance of Theorem 15,^1 lies in the fact that we can de-condition b\ and b 2 to 
obtain again the connection between maximal Brownian functionals and both the GUE or the traceless 
GUE, as shown next. Indeed, replace b\ and b 2 by iid standard normal random variables X\ and X 2 
which are independent of(Yi,Zi) in the random matrix setting. Then, the joint density of (Vy,, Xi, X 2 ) 
is given by 

g 3 (v, xi,x 2 ) = (2v - x\ - x 2 )(p(xi)(p(x 2 ) exp (-(v - xi){v - x 2 )), (5.21) 

for Xi,x 2 G K and v > max(xi,X2), where 92 is the standard normal density. 

Let us first examine the constant-trace case by letting u>\ = x\ + x 2 and w 2 = x\ — x 2 and then 
conditioning the joint law in (|5.2ip on w%. An elementary calculation shows that the conditional 
density is simply 

h(y\w x ) = ±(v-^)\ W (v - ^) 2 , (5.22) 



for v > w\/2. In particular, for w\ = 0, we do indeed recover the traceless case: 



4 



h(v\wi = 0) = -^=v 2 e~ v \ (5.23) 
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for v > 0, which is the density associated with the square root of a (rescaled) Chi-squared random 
varible with three degrees of freedom, as expected. 

Next, let us return to the joint density in (|5.2ip and successively integrate out x\ and x 2 . Writing 
<3? for the standard normal cumulative distribution function, we find that the joint density of (Vbb,X 2 ) 
is given by 

g 2 {v, x 2 ) = ip(v) (<p(x 2 ) + v$(x 2 )) , (5.24) 

for —00 < x 2 < v < +00. Integrating once more, we obtain the density of the largest eigenvalue of the 
2x2 GUE, 

9l (v) = <p(v) ((1 + v 2 )$(v) + v<p(v)) , (5.25) 

for all v € R. We have thus recovered the traceless and standard GUE distributions from our condi- 
tional result. 

The results above are all ultimately based on the trivial fact that we can solve a quadratic equation. 
While larger random matrices do not in general yield to such straightforward analyses, there are 
nonetheless certain classes that can be linked to distributions of maximal Brownian functionals. The 
following development consolidates several of these and leads to a more general conjecture. 

Let B be m-dimensional standard Brownian motion and introduce the notation 



diag(vl) 

for any m x m matrix A and 



( ^ \ 

m.m) 
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1- 




■') 


\v m J 






V m J 



for any vector t> = . . . , v m ) T . Now recall the Brownian functional introduced by Glynn and Whitt 
[13] . namely, 

m 

£> m = max^S fe (At fc ), 
fc=i 

where the maximum is taken over the usual Weyl chamber, and from results of Baryshnikov [4J and 
Gravner, Tracy, and Widom |15| . D m = A max (M), for M an m x m element of the GUE. To extend 
this identity to a broader class of random matrices, first let A = al + l m 6 T , where a € R, 6 £ M m , 
and where l m = (1, 1, ... , 1) T S M m . Next, letting B = AB, the corresponding maximal Brownian 
functional associated with B is such that 



An := maxJ2B k (At k ) 

k=l 

ml m 

= max^ aB k (At k ) +^bjB 3 (At k ) 



k=l 



3=1 



j=l fc=i 
aD™ + 6 T B(1) 

(M) + b T diag(M) 
(oM + 6 T diag(M)J) 
(M) 



Ax 
A, 



where M = aM + b T diag(M)I. 

This rank-one perturbation result, when combined with Theorem 13.31 or Corollary 13.11 and taking 
a = 1 and bi = — 1/m, i = 1, . . . ,m, recovers [28j only using combinatorial and probabilistic techniques 
and therefore bypassing the analytical ones present there. It also shares, as shown next, its amenability 
to analysis with two further cases: the cyclic case for m = 2 or 3, and also the case A = aP, where P 
is an arbitrary permutation matrix and a an arbitrary real. Write the general m x m cyclic matrix A 

/0 1\ 

, i o .: 

as A = J^fcLo a kP i where P is the permutation matrix P 



1 



VO 



Such a cyclic A also 



i oy 
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yields a cyclic covariance matrix E := AA T , and moreover, 

mm mm m—1 m 

D m := maxJ^JTAjy.B^Atfc) = max^ a k ^{At k ) = max ^ o n B k - r (At k ). 

k=lj=l k=lj=l r=0 k=l 

We claim that for all three cases above, 

(X 1 \ \ 



max 



V X m J J 



where / is an as-yet unspecified function of, cr(A), the spectrum of A, (X\, . . . , X m ) T ~ iV(0, E), 
E = AA T , and M is a " GUE matrix" with all diagonal entries of 0. Writing M = f(a(A))M + 
(X 1 \ 

, we then have D m = A max (M). Let us first look at the simple case of A = aP, 

V o x m J 

where P is an arbitrary permutation matrix with Pij = <5i, CT (j), a a permutation of {1,2,..., m}. Then 

/X 1 \ 

B = aPB = aB, and so D m = max £™ =1 aB a ^(At k ) = aD m , while M = aM + 

V o xj 

where (Xi, . . .,X m ) T ~ N(0, E), E = AA T = a 2 /. But since M = aM + a Diag(diag(M)) = aM, 

c 

M 6 GUE, we simply recover the base case D m = aA max (M) and see that /(a(A)) = a. 

Next, consider the degenerate cyclic case where cto = cti = • ■ • = «m-i = a > so that A = al m l^. 
But this is just a special case of the rank-one perturbation A = al + l rn b T ', so that £> m = al^S(l) ~ 
iV(0,ma 2 ), and M = 0M + al^(X 1; . . . , X m ) T / = oEL^/, so that A max (M) = a^™ =1 X fc ~ 
N(0,ma 2 ). In this pure Gaussian case, f(a(A)) = 0. 

More generally, as previously shown, for A = al + l m b T , 

Dm = A max (a M + {b T l m ) diag(M)J) 

= A max (aM + (a + b T l m ) diag(M)/) 
= A max (aM + Adiag(M)/), 

and since Adiag(M) ~ JV(0, E), /(<r(A)) = a. 

Let us now look at the low-dimensional cyclic cases in turn. For the 2x2 cyclic case, let A = 



clqI + cti ^ ^ = (a - ai)J + ail 2 l^. Then 

M = (a - ai)M + aiLj diag(M)/ 



= (ao - oi)M + (ao - ai) Diag(diag(M)) + aiLj diag(M)/ 
= (a - ai)M + Diag(((a - ai)J + a^h) diag(M)) 
= (a - Oi)M + Diag(Adiag(M)), 
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and here Adiag(M) ~ N(0, E) and so f(o~(A)) = ao — a%. Finally, consider the 3-dimensional cyclic 

(ao a 2 aA 
ai a a 2 , then E = A4 T = ^]fe=i a fc^ + Sfc^£ a ft°f(l3l3 ~ I) = (Y.k=i a k ~ 
a 2 ai a J 

Y^k^e a k a e)I + (Y.k^e a k a i)^3 '■ But we can also write E = (a/ + /Sl^lg^a/ + /31 3 l3) T = a 2 I + 
(2a/3 + S/S 2 )^. Thus a 2 = ELi a fc " E fe ^«fe^> therefore 3/3 2 + 2a/3 = a^a^ and so 

/3 = -a/3 ± l/3yjYlk=i a l ~ ^Ylk^i a k a i- Now > take a = \jYll=i a l ~ Y^k^e a k a i and P = -a/3 + 
yXJfcli a fc ~ 3 a k a e /3. But, Z) 3 = A max (aM + Diag(a/ + /31 3 lf) diag(M)), so that al + 

/31 3 l 3 P diag(M) ~ iV(0,E), where f(a(A)) = a. Noting that M and the matrix 

are independent, we can summarize all the above results in the following theorem: 

Theorem 5.3 Let B = {B{t))o<t<\ be a standard m- dimensional Brownian motion, A an m x m 
real matrix, and B = AB, so that B has covariance matrix St = AA T t. Then, in the cases noted 
in the table below, the maximal functional D m = max J2T=i B h (Atk) is equal in law to the largest 
eigenvalue of an m x m Hermitian Gaussian random matrix having diagonals distributed as N(0, E) 
and off-diagonals with total variance given by f(a(A)), independent of each other and of the diagonal 
entries. 



Case 


a(A) 




m x m, A = aP, a E R, P perm. 


{a, a, . . . ,a} 


a 


m x m, A = al + lmb 1 


{a + b T l m ,a,...,a} 


a 


2x2 cyclic A = X^ =0 a k P k 


{a + ai,a - ai} 


a - ai 


3x3 cyclic A = Yll=o a k P h 


{a + ai + a 2 , 


«= \/El=iO- 2 k -Ek^£ a kae 




ao + a\u + a 2 u 2 


= ao + a\u + a 2 u 2 \ 




ao + a\u 2 + a 2 u}, 
u = e 2iw / 3 


= \ao + a\u 2 + a 2 u\ 




6 Concluding Remarks 

In this paper, we have obtained the limiting shape of Young diagrams generated by an aperiodic, irre- 
ducible, homogeneous Markov chain on a finite state alphabet. The following remarks indicate natural 
directions in which our results in some cases can, and in other cases, may hope to, be extended. 

• Our limiting theorems have all been proved assuming that the initial distribution is the stationary 
one. However, such results as Theorem 2 of Derriennic and Lin [10] allow to extend our framework 
to initial distributions started at a specified state. Indeed, in this case, i.e., if for some k = l,...,m, 
P(Xo = a k ) = 1, the asymptotic covariance matrix is still given by (|2.14p . and, for example, Theorem 
13.21 remains valid. For an arbitrary initial distribution, what is needed in this non-stationary context 
is an invariance principle. More generally, our results continue to hold for k th -oxder Markov chains, 
and in fact, they extend to any sequence for which both an asymptotic covariance matrix and an 
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invar iance principle exist. 

• Our limiting theorems have only been proved for finite alphabets. However, from the authors' 
previous work [T7j, it is known that for countably infinite iid alphabets, LI n has a limiting law corre- 
sponding to that of a non-uniform, finite-alphabet. Hence, for a countably infinite-alphabet Markov 
chain (subject to additional constraints), we might still be able to obtain limiting laws of the form 
developed in this paper. 

• By using appropriate existing concentration inequalities, one can expect to establish the conver- 
gence of the moments of the rows of the diagrams. 

• One field in which the connection between Brownian functionals and random matrix theory has 
been exploited is in Queuing Theory. The development below, following O'Connell and Yor [25], shows 
how Brownian functionals of the sort we have studied arise as generalizations of standard queuing 
models. 

Let A(s,t] and S(s,t], — oo < s < t < oo, be two independent Poisson point process on R, with 
intensity measures A and /u, respectively, with < A < [i. Here A represents the arrivals process, and 
S the service time process, at a queue consisting of a single server. The condition A < fi ensures that 
the queue length 

Q(t)= sup {A(s,t]-S(s,t}}, (6.1) 

— oo<s<t 

is a.s. finite, for any t£R. Then, defining the departure process 

D(s,t] = A(s,t}-(Q(t)-Q(s)), (6.2) 

which is simply the number of arrivals during (s,t] less the change in the queue length during (s,t], 
the classical problem is to determine the distribution of D(s,t]. The answer to this problem is given 
by Burke's Theorem [8] (see Theorem 1 of [25]): 

Theorem 6.1 D is a Poisson process with intensity X, and {D(s, t], s < t} is independent of{Q(s), s > 
t}. 

That is, D has the same law as the arrivals process A. Moreover, since, the queue length after time 
t is independent of the process D up to time t, one may take the departures from the first queue and 
use them as inputs to a second queue, and observe that the departure process from the second queue 
also has the law of A. Proceeding in this way, one generalizes to a tandem queue of n servers, each 
taking the departures from the previous queue as its arrivals process. 

One can further generalize this model to a Brownian queue in tandem in the following manner. 
Let B, B l ,B 2 , . . . , B n be independent, standard Brownian motions on M, and write B k (s, t) = B k (t) — 
B k (s), for each k and s < t, and similarly for B. Let m > be a constant, and define, in complete 
analogy to (I6.ip and (16. 2p . 

qi{t)= sup {B(s,t) + B 1 (s,t)-m(t-s)} , (6.3) 

— oo<s<t 

and, for s < t, 



49 



di(s,t) = B(s,t) - ( qi (t) - gi (s)). (6.4) 

For k = 2, 3, . . . , n, let 

g fc (t)= sup {4-i(s,t) + £ fc (s,t)-m(t-s)), (6.5) 

and, for s < t, 



d k (s, t) = dk-i(s, t) - (q k (t) - q k (s)). (6.6) 

Here B is the arrivals process for the first queue, d k -\ is the arrivals process for the k th queue 
(k > 2), and mi — B k (t) is the service process for the k th queue, for all k. Using the ideas employed 
in Burke's Theorem, it can be shown that the generalized queue lengths <zi(0), ^(O), ... , q n (0) are iid 
random variables. Moreover, they are exponentially distributed with mean 1/m. 

Using the definitions in (|6.3p - (|6.6p . and a simple inductive argument, one finds that 



V%(0) = sup{ J B(-t,0) -mt + L n (t)\, (6.7) 

where 



L n (t)= sup {B 1 (-t,-s n - 1 ) + --- + B n (-s 1 ,0)}. (6.8) 

0<si<- 
<s m -i<t 

By Brownian rescaling, we observe that 



L n (t) = Vt sup {B\-l,-s n _ 1 ) + --- + B n (-s 1 ,0)} = VtV^, (6.9) 
0<si<- 

<S m _l<l 

where the functional is as in Theorem I3.2| with associated n x n covariance matrix S = tl n and 
parameter set I\^ n . Thus, L n (t) may be thought of as a process version of this V^. 

The generalized Brownian queues in (|6.3p - (j6.6p involved independent Brownian motions. These can 
be replaced by Brownian motions B , . . . , B n for which (o"ii? 1 (t), . . . , a n B n (t)) has (nontrivial) covari- 
ance matrix tT,. Whether or not we keep the initial arrival process B independent of (B ,. . . ,B n ), we 
no longer have that <?i(0), 92(0), . . . ,q n (0) are iid random variables, due to the dependence among the 
service times mt — B k (t), but we do still have the identity (j6.7|) and ()6.9[) relating the total occupancy 
of the queue at time zero to V^. More importantly, our generalizations of the Brownian functionals 
L n (t) above can be used to describe the joint law of the input/output of each queue. 

• An important topic connecting much of random matrix theory to other problems, such as the shape 
of random Young diagrams, is the field of orthogonal polynomials. (See, e.g., |22j.) It would be of 
great interest to see what, if any, classes of orthogonal polynomials are associated with the present 
paper. 
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